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ABSTRACT 


In  this  thesis  we  investigate  the  problem  of  obtaining  solutions  for  large, 
separable  queueing  network  models  of  computer  systems.  To  this  end,  we  begin 
with  a  detailed  analysis  of  the  time  and  space  requirements  of  the  two  major 
exact  solution  techniques.  Subsequently,  we  develop  three  approximations 
based  on  aggregation  of  network  components:  aggregation  of  classes,  aggrega¬ 
tion  of  customers,  and  aggregation  of  service  centers. 

Each  approximation  technique  reduces  the  cost  of  solving  the  network 
(approximately)  by  reducing  the  number  of  some  component  in  it  that  contri¬ 
butes  to  the  high  cost  of  solution.  In  general,  the  approximations  transform  a 
large  separable  network  into  a  smaller,  also  separable  network  that  reflects  the 
behavior  of  the  original  network.  The  exact  solution  for  this  reduced  model  is 
then  obtained  using  standard  techniques,  and  estimates  of  the  performance 
measures  of  the  full  network  are  derived  from  those  of  the  reduced  network. 

Wc  present  with  each  technique  an  explanation  of  its  relation  to  the  state 
space  of  the  full  network.  This  formal  study  is  augmented  by  example  solutions 
using  the  approximations  for  representative  sets  of  models  to  gain  insight  into 
the  error  behavior  of  the  techniques. 

Finally,  we  discuss  an  alternative  approximation  technique  not  based  on 
aggregation.  A  comparison  of  all  techniques  is  given,  with  the  problem  of  select¬ 
ing  an  appropriate  technique  for  a  given  network  in  mind. 
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CHAPTER  1  -  PROBLEM  OVERVIEW 


As  computer  software  and  hardware  technology  has  advanced,  each  com¬ 
puter  system  component  has  become  more  complicated.  For  example,  secon¬ 
dary  storage  technology  has  moved  from  punched  tape  through  magnetic  tape, 
magnetic  disks,  rotational  position  sensing  disk  systems,  and  shared  DASD,  with 
associated  increases  in  both  performance  and  complexity  of  the  storage  devices 
at  each  step.  At  the  same  time  central  processing  units  have  progressed  from 
uniprogramming  systems  to  multiprogramming,  multiprocessing,  virtual 
memory  systems.  The  performance  prediction  problem  for  modern  computer 
systems  has  been  made  more  difficult  both  because  of  increased  complexity  of 
these  components  and  because  of  increased  complexity  in  component  interac¬ 
tion,  since  these  interactions  are  a  significant  factor  in  overall  performance. 

Coupled  with  the  improvements  in  the  performance  of  computer  systems 
has  been  an  exponential  growth  in  the  workload  of  the  typical  system,  with  the 
expectation  that  this  trend  will  continue.  These  two  factors,  the  growing  com¬ 
plexity  of  computer  systems  and  the  rapidly  growing  number  of  tasks  that  they 
are  being  used  to  perform,  make  it  increasingly  important  that  quantitative 
tools  be  used  to  aid  system  capacity  and  configuration  planning.  The  intuition  of 
even  a  very  good  systems  programmer  is  no  longer  adequate  to  predict  reliably 
how  changes  in  the  workload  or  system  configuration  will  affect  overall  system 
performance. 

Because  of  this  need  for  quantitative  tools  to  aid  in  performance  prediction, 
much  work  has  been  done  recently  on  applying  queueing  network  models  to 
computer  systems.  These  models  are  fairly  naturally  applied,  as  the  major  fac¬ 
tors  affecting  the  performance  of  a  computer  system  (the  hardware  devices 
comprising  the  system  and  the  workload  placed  on  it)  have  direct  counterparts 
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in  queueing  models.  In  the  simplest  sense,  a  queueing  network  model  is  merely 
a  tool  to  evaluate  the  impact  on  system  performance  of  competition  for 
resources  by  the  jobs  running  on  the  system. 

Queueing  network  models  consist  of  two  basic  entities:  service  centers  and 
customers.  A  service  center  represents  some  component  of  the  computer  sys¬ 
tem  that  delivers  service  to  the  jobs  running  on  the  system.  Typical  examples 
are  CPU’s  and  disk  I/O  devices.  A  job  in  a  computer  system  that  requires,  say, 
some  CPU  processing  must  compete  with  other  jobs  for  the  use  of  that  facility. 
The  progress  of  a  job  through  the  system  is  thereby  delayed  both  by  the  time 
spent  queueing  for  the  CPU  and  the  time  spent  actually  being  processed  by  the 
CPU.  Exactly  this  same  behavior  occurs  in  the  queueing  network  model:  there  is 
a  delay  associated  with  both  the  queueing  time  and  the  actual  service  time.  In 
the  simplest  (and  most  common)  models,  each  hardware  component  of  the  com¬ 
puter  system  is  represented  by  a  single  service  center  in  the  queueing  network 
model. 

Each  job  in  the  computer  system  is  a  competitor  for  the  hardware  devices 
that  might  interfere  with  or  be  interferred  with  by  the  attempts  of  other  jobs  to 
obtain  service  at  the  same  devices.  In  a  queueing  model  these  competitors  are 
called  customers.  Generally,  there  is  one  competitor  (customer)  in  the  queue¬ 
ing  network  model  for  each  competitor  (job)  in  the  computer  system. 

One  of  the  essential  components  of  the  modeling  process  is  to  simplify  the 
actual  system  by  abstracting  out  insignificant  detail,  and  retaining  only  those 
characteristics  of  the  system  that  are  significant  to  the  study  being  peformed. 
One  way  that  this  simplification  is  done  when  applying  queueing  network  models 
is  to  assume  that  many  jobs  are  essentially  equivalent  as  far  as  their  effect  on 
system  performance  is  concerned.  These  groups  of  "equivalent"  jobs  are 
represented  by  a  single  class  of  customers  in  the  queueing  model.  All  custo- 
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mers  of  the  model  in  a  single  class  are  statistically  indistinguishable.  In  the 
simplest  modeling  studies  all  jobs  of  the  computer  system  are  assumed  to  be 
equivalent,  and  a  queueing  model  with  only  one  class  of  customers  is  used  to 
model  them.  However,  there  are  two  reasons  for  complicating  the  model  by 
introducing  additional  classes.  The  first  is  that  that  workload  on  the  computer 
system  may  naturally  be  partitioned  into  a  number  of  groups  such  that  jobs 
within  each  group  are  indeed  similar,  while  jobs  from  distinct  groups  are  quite 
different.  For  instance,  a  measurement  of  an  actual  system  may  show  one  group 
of  CPU  bound  jobs  and  another  group  of  I/O  bound  jobs.  This  situation  generally 
implies  that  more  accurate  predictions  can  be  obtained  from  the  queueing 
model  by  preserving  these  distinctions  using  customer  classes. 

The  other  reason  to  introduce  multiple  classes  into  the  model  is  that  one 
may  be  interested  in  obtaining  output  statistics  from  the  model  that  are  specific 
to  each  distinct  component  of  the  system  workload.  For  instance,  on  a  system 
with  both  batch  and  interactive  processing  it  might  be  projected  that  the 
interactive  portion  of  the  workload  will  double  in  the  next  two  years,  while  the 
batch  portion  will  increase  only  50%.  A  system  analyst  might  be  interested  in 
the  effect  on  interactive  response  time  of  this  growth  in  the  workload.  In  this 
case  there  is  a  natural  partitioning  of  the  workload  into  two  classes,  batch  and 
interactive,  both  so  that  each  component  can  be  altered  independently  to 
reflect  projected  workloads  and  because  the  output  statistics  desired  from  the 
model  should  pertain  to  one  segment  of  the  workload  only. 

The  successful  application  of  queueing  network  models  has  been  demon¬ 
strated  by  a  growing  number  of  case  studies  (see  [Kienzle  and  Sevcik  79]  and 
[Graham  78]  for  examples).  Most  of  these  studies  have  been  for  centralized  sys¬ 
tems,  and  have  required  models  with  only  a  few  service  centers,  customers,  and 
classes.  However,  the  current  trend  is  for  computer  systems  to  be  decentral- 
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ized. 

This  trend  toward  decentralization  has  produced  systems  with  a  large 
number  of  hardware  components  that  are  generally  intended  to  serve  a  large 
user  community.  Such  increases  in  both  the  number  of  components  and  the  size 
of  the  workload  translate  directly  into  increases  in  the  number  of  service 
centers  and  customers  in  the  queueing  networks  used  to  represent  these  sys¬ 
tems. 

Growth  in  the  size  of  the  workload  is  exemplified  by  computer  communica¬ 
tions  networks,  distributed  data  base  systems,  banking  systems,  airline  reserva¬ 
tion  systems,  and  point-of-sale  systems.  In  each  case  hundreds  or  thousands  of 
terminals  may  be  active  on  the  system  at  a  time.  Since  each  active  terminal 
user  represents  a  competitor  for  the  resources  of  the  system  being  accessed, 
the  user  is  generally  represented  by  a  single  customer  in  a  queueing  network 
model.  This  means  that  there  are  potentially  thousands  of  customers  in  the 
models  of  these  systems.  This  is  a  substantial  increase  over  queueing  network 
models  representing  more  centralized  systems,  where  there  are  typically  less 
than  a  dozen  customers. 

As  the  number  of  jobs  of  the  computer  system  grows,  one  can  expect  that 
the  number  of  customer  classes  required  to  represent  them  accurately  will  also 
grow.  One  obvious  distinction  among  the  jobs  in  a  decentralized  system  is  their 
geographic  location.  Users  (jobs)  in  one  location  are  likely  to  use  a  different  set 
of  communications  lines  than  users  in  another  location.  If  one  is  interested  in 
modeling  the  load  on  each  line  accurately,  customer  classes  must  be  used  to 
reflect  these  usage  patterns.  A  large  number  of  classes  can  also  be  required  to 
model  network  flow  control  protocols.  For  instance,  the  most  natural  model  of 
end-to-end  control  requires  a  distinct  customer  class  for  every  source- 
destination  pair  in  the  network.  This  clearly  can  lead  to  a  large  number  of 
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classes. 

All  of  the  applications  mentioned  above  lead  to  queueing  models  with  large 
numbers  of  service  centers,  since,  in  general,  there  will  be  one  service  center 
for  each  physical  component  of  the  computer  system.  Multiple  microprocessor 
systems,  such  as  Cm*  [Swan  et  al.  77],  also  require  models  with  many  service 
centers.  As  it  seems  likely  that  the  trend  towards  these  kinds  of  systems  will 
continue,  it  is  reasonable  to  assume  that  many  models  of  interest  will  contain  a 
large  number  of  centers. 

The  obvious  effect  of  these  trends  on  the  systems  analyst  using  queueing 
models  is  that  he  must  employ  models  with  increasing  numbers  of  all  three 
model  components:  centers,  customers,  and  classes.  Since  the  computing  effort 
required  to  solve  a  queueing  network  model  grows  quite  rapidly  with  the  size  of 
the  model,  it  is  apparent  that  many  of  these  models  will  exceed  both  the  willing¬ 
ness  and  the  capability  of  the  analyst  to  solve  them  exactly. 

As  a  practical  example  of  a  computer  system  that  naturally  gives  rise  to  a 
large  model,  consider  the  teleprocessing  system  used  by  the  Toronto  Stock 
Exchange  (TSE).  The  purpose  of  the  system  is  to  provide  "ticker  tape"  style 
information  about  trading  activity  in  TSE  (and  other  exchanges)  for  brokers  in 
Canada  and  parts  of  the  United  States.  To  do  this,  the  TSE  system  must  con¬ 
stantly  communicate  updated  information  about  recent  transactions  to  each  of 
the  brokerages  connected  to  the  system,  as  well  as  answer  specific  enquiries 
from  the  brokers. 

Each  terminal  connected  to  the  system  is  divided  logically  into  three 
screens.  The  user  has  the  ability  to  request  any  one  of  a  number  of  displays  in 
each  of  these  screens.  Typical  displays  include  the  standard  ticker  tape  of 
trades,  as  well  as  more  restrictive  displays  that  select  only  a  certain  set  of 
stocks  (e.g.,  petroleum  stocks).  The  information  on  the  screens  is  automatically 
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updated  with  any  trading  information  affecting  it.  This  is  done  continuously,  and 
does  not  require  user  intervention.  In  addition  to  these  displays  of  current 
information,  the  user  may  request  information  that  is  more  static,  such  as  the  1 
volume  of  trades  of  a  specific  stock  that  occured  the  previous  trading  day.  A 
typical  terminal  has  one  screen  devoted  to  ticker  tape  information,  and  possibly 
another  to  more  specific  displays  of  current  information.  Specific  enquiries  to 
the  system  are  entered  at  the  rate  of  about  one  per  minute. 


Figure  1.1  -  TSE  Teleprocessing  System 

A  diagram  of  the  key  hardware  components  of  the  system  is  given  in  figure 
1.1.  Information  on  tradings  is  sent  from  the  various  exchanges  along  a  number 
of  lines  to  a  large,  central  computing  facility.  This  central  facility  records  the 
information,  and  then  broadcasts  updated  information  on  the  trade  over  a 
number  of  transmission  lines  that  connect  it  to  the  users  (brokers).  From  one 
to  five  LSI/1  l’s  are  multidropped  on  each  line.  Approximately  ten  terminals  are 
connected  to  each  LSI.  The  LSI/ll’s  act  as  a  terminal  handlers.  They  must 
examine  each  piece  of  update  information  and  decide  whether  or  not  it  pertains 
to  information  being  displayed  on  any  of  the  terminals  that  it  controls.  If  not. 
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the  information  is  discarded.  However,  if  the  update  does  affect  any  of  the  ter¬ 
minals,  the  appropriate  displays  are  changed  to  reflect  the  new  information. 

The  transmission  lines  connecting  the  LSI/ll’s  to  the  central  system  may 
be  either  single  9600  baud  lines  (for  short  distances),  or  dual  4800/2400  lines 
(for  longer  distances).  The  single  lines  are  used  to  transmit  both  broadcast 
information  and  to  respond  to  specific  enquiries.  On  dual  lines,  the  4800  baud 
line  is  used  for  broadcast  information,  and  the  2400  baud  line  is  used  to  respond 
to  enquiries. 

The  purpose  of  modeling  this  system  was  to  investigate  the  answer  to  such 
questions  as: 

1.  What  would  be  the  effect  of  replacing  the  LSI/1  l’s  by  processors 

that  are  N%  faster? 

2.  What  will  be  the  effect  of  increases  in  the  workload  on  each 

transmission  line? 

3.  If  data  being  sent  over  the  transmission  lines  is  first  compressed 

by  the  central  system  (to  increase  the  effective  capacity  of  the 
lines),  will  the  LSI/1  l’s  be  able  to  handle  the  increased  load  of 
expanding  the  compressed  data? 

None  of  these  questions  relate  directly  to  the  capacity  of  the  central  system,  as 
it  was  felt  that  this  component  of  the  network  had  sufficient  power,  and  would 
not  become  the  bottleneck  in  the  near  future. 

A  simple  model  that  might  be  used  to  investigate  questions  like  those  above 
is  shown  in  figure  1.2.  The  model  is  of  a  system  with  ten  transmission  lines.  All 
lines  and  LSI/ll’s  are  represented,  since  they  are  the  focus  of  the  study.  Since 
specific  information  about  the  internals  of  the  central  system  is  not  desired,  the 
model  has  been  simplified  by  representing  the  central  system  as  a  single,  load 
dependent  service  center.  Load  dependent  service  rates  at  the  central  system 
reflect  the  fact  that  the  throughput  of  the  subsystem  increases  with  load,  since 
there  is  increased  opportunity  for  parallelism. 
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The  workload  is  represented  in  the  model  by  forty  customer  classes,  one  for 
each  LSI/11.  This  is  done  because  the  various  LSI/1  l’s  do  not  necessarily  have 
nearly  equal  usage.  Separating  the  load  on  each  LSI  allows  us  to  study  the 
effects  of  the  varying  workloads.  The  number  of  customers  in  each  class 
depends  on  the  number  of  terminals  connected  to  the  LSI/11.  This  is  generally 
about  ten.  The  total  number  of  customers  in  the  model  is  405. 

We  will  show  in  chapter  3  that  the  time  and  space  required  to  solve  a  (load 
independent)  separable  queueing  network  model  is  of  the  order  of  the  product 
of  the  number  of  customers  in  each  class.  For  the  TSE  model  described  above, 
this  is  about  1040.  It  is  therefore  clearly  infeasible  to  obtain  the  exact  solution 
to  this  modeL 

It  is  this  practical  problem  of  obtaining  the  solution  of  large  queueing  net¬ 
work  models  that  is  addressed  in  this  thesis.  We  take  the  viewpoint  that  the  sys¬ 
tem  modeling  task  is  already  sufficiently  difficult  that  the  modeler  should  be 
allowed  to  create  the  natural  model  obtained  by  equating  hardware  resources 
with  service  centers  and  jobs  with  customers,  and  should  not  be  overly  con- 
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cerned  with  the  size  of  the  network  thus  produced.  The  purpose  of  the  work 
presented  here  is  to  obtain  approximate  solutions  for  these  potentially  large 
models  that  are  as  accurate  as  possible  with  limited  computational  effort. 

The  approach  used  is  to  reduce  the  complexity  of  the  models  by  aggregat¬ 
ing  groups  of  model  components  into  single  components.  The  resulting  aggre¬ 
gate  model  is  therefore  smaller,  and  so  is  cheaper  to  solve.  We  develop  aggrega¬ 
tion  procedures  for  each  of  customer  classes,  customers,  and  service  centers. 
Each  aggregation  procedure  cam  be  used  alone  or  in  combination  with  the  oth¬ 
ers. 

The  general  procedure  for  each  of  the  three  types  of  aggregations  is  the 
same.  In  each  case  we  do  the  following: 

1.  Transform  a  large  queueing  network  model  into  a  smaller  model  by 
aggregating  classes,  customers,  or  service  centers.  This  smaller 
model  is  called  the  aggregate  model. 

2.  Obtain  the  exact  solution  of  the  aggregate  model. 

3.  Use  the  performance  measures  for  the  aggregate  model  obtained 
in  step  2  to  compute  approximations  for  the  performance  meas¬ 
ures  of  the  original,  large  network. 

Step  1  and  to  a  lesser  extent  step  3  are  the  keys  to  these  techniques.  Step 
1  is  crucial  since  the  aggregation  procedure  must  be  done  in  such  a  way  that  the 
aggregate  model  behaves  in  a  manner  similar  to  the  original  network  if  Ave  are  to 
get  a  reasonable  approximation.  Step  3  is  important  since  it  involves  the  actual 
computation  of  the  approximate  performance  measures.  The  solution  of  the 
aggregate  network  in  general  gives  only  limited  information  about  the  solution 
of  the  original  network,  so  a  good  choice  for  the  transformation  of  the  aggregate 
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performance  measures  is  needed  if  the  approximation  is  to  be  accurate. 

The  major  contributions  of  this  thesis  are  the  following: 

1.  A  complexity  analysis  of  the  two  major  exact  solution  techniques 
for  the  most  widely  used  class  of  queueing  network  models. 

2.  Aggregation  techniques  for  each  of  customer  classes,  customers 
and  service  centers  that  make  practical  the  approximate  solution 
of  large  models,  including  those  with  load  dependent  service 
centers. 

3.  Experimental  evidence  of  the  error  behavior  of  the  aggregation 
procedures  based  on  a  coherent  set  of  examples. 

4.  A  comparison  of  the  approximation  procedures  and  their  applica¬ 
bility  to  different  types  of  networks. 

The  remainder  of  this  thesis  is  organized  as  follows.  Chapter  2  presents  the 
definitions  of  terms  and  notation  used  throughout  this  thesis,  and  summarizes 
previous  work  in  the  area.  Chapter  3  contains  a  detailed  complexity  analysis  of 
the  two  major  exact  solution  techniques  for  the  class  of  queueing  network 
models  with  which  we  are  concerned.  Chapters  4,  5,  and  6  present  the  class, 
customer,  and  center  aggregation  procedures  respectively.  In  chapter  7  we 
compare  the  approximation  procedures,  and  comment  on  their  relative  merits. 
Chapter  8  is  a  discussion  of  an  alternative  approach  to  the  problem  of  solving 
large  models.  Finally,  Chapter  9  summarizes  the  thesis  and  suggests  possible 


areas  for  future  wrork. 


CHAPTER  2  -  QUEUEING  NETWORK  MODELS  OF  COMPUTER  SYSTEMS 


In  this  thesis  we  are  concerned  with  a  subset  of  the  full  class  of  queueing 
network  models.  Queueing  networks  in  this  subset  are  referred  to  as  separable 
models  (alternatively  product  form  or  local  balance  models,  the  three  terms 
being  interchangeable).  The  important  characteristic  of  separable  queueing 
network  models  that  distinguishes  them  from  other  models  is  that  their  (equili¬ 
brium)  solution  can  be  obtained  relatively  quickly.  For  this  reason,  most  actual 
case  studies  involving  computer  performance  prediction  using  queueing  network 
models  have  used  separable  models.  These  case  studies  have  been  very  success¬ 
ful  generally,  and  it  is  reasonable  to  assume  that  separable  queueing  networks 
are  a  valid  tool  with  which  to  do  performance  analysis. 

2.1  SEPARABLE  QUEUEING  NETWORKS 

Separable  queueing  networks  are  specified  by  a  very  few  parameters.  In 
this  section  we  define  those  parameters,  emd  present  our  notation  for  describing 
them.  We  present  only  that  subset  of  separable  networks  that  has  proven  gen¬ 
erally  useful  in  systems  modeling.  The  reader  is  referred  to  [Chandy  et  al.  77] 
for  a  more  complete  discussion  of  essential  properties  shared  by  all  separable 
networks.  Our  notation  is  summarized  in  appendix  A. 

A  separable  network  consists  of  two  entities:  service  centers  and  custo¬ 
mers.  Customers  of  the  queueing  network  generally  represent  jobs  of  a  com¬ 
puter  system,  while  service  centers  generally  represent  hardware  resources 
that  deliver  service,  such  as  a  CPU.  Customers  in  the  queueing  network  travel 
from  service  center  to  service  center  obtaining  service.  A  customer  is  delayed 
at  a  center  for  both  the  time  required  for  it  to  receive  the  desired  service,  and 
for  the  amount  of  time  spent  in  queue  at  the  center  due  to  congestion  by  other 
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customers.  The  mean  amount  of  service  that  a  customer  requires  at  each 
center  is  specified  by  that  customer’s  service  time  vector.  S'  =  )• 

,\ 

Here  M  is  the  number  of  service  centers  in  the  network,  and  is  the  non¬ 
negative  mean  service  requirement  of  the  customer  at  center  m  for  each  visit 
there.  The  number  of  visits  the  customer  makes  to  each  center  is  specified  by 

its  relative  throughput  vector,  e  =  The  ratio  — —  is  the  number  of 

em’ 

visits  to  center  m  that  the  customer  makes  for  each  visit  to  center  m\  In  this 
sense  e  gives  only  relative  values,  and  so  multiplying  e  by  a  (positive)  scalar  does 

not  affect  the  network.  The  product  of  5  and  e  is  called  the  relative  utilization 

vector  for  the  customer,  and  is  denoted  A’’.  By  definition 

x  =  (Xi,Xz . Xu) 


The  ratio 


m 


is  the  total  amount  of  service  the  customer  requires  from  center 


m 


m  relative  to  fits  requirement  at  center  m'.  Once  again,  the  vector  gives  only 
relative  values,  so  that  multiplying  X  by  a  (positive)  scalar  does  not  affect  the 
network. 

All  customers  in  the  model  with  relative  utilization  vectors  that  are  scalar 
multiples  of  each  other  belong  to  the  same  customer  class.  These  customers 
have  the  property  that  they  are  statistically  identical,  that  is,  they  are  indistin¬ 
guishable  in  the  network.  We  denote  the  number  of  classes  in  the  network  by/?. 
If  the  number  of  customers  in  some  class  r  does  not  vary  with  time,  the  class  is 
closed.  The  number  of  customers  in  closed  class  r  is  denoted  Nr  and  is  called 
the  mijltiprogramming  level  of  class  r.  If  a  class  is  not  closed,  the  number  of 
customers  in  it  is  allowed  to  vary  by  postulating  an  infinite  source  from  which 
customers  may  arrive  to  the  network,  and  an  infinite  sink  into  which  they 
depart.  There  may  therefore  be  any  non-negative  number  of  customers  of  that 
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class  in  the  network  at  any  point  in  time.  Such  classes  are  called  open  classes. 

Service  centers  deliver  service  at  rates  that  may  depend  upon  the  custo¬ 
mer  population  at  the  center.  In  the  simplest  case  the  center  delivers  service  at 
rate  (1  unit  of  service)/(time  unit)  regardless  of  the  population.  Such  centers 
are  said  to  be  load  independent.  If  a  service  center  is  not  load  independent  it  is 
called  load  dependent.  Load  dependent  service  centers  can  have  service  rates 
that  depend  on  the  customer  population  in  either  of  two  (principal)  ways: 

1.  The  service  rate  depends  only  on  the  total  number  of  customers  at 

the  center. 

2.  The  service  rate  for  class  r  depends  only  on  the  number  of  class  r 

customers  at  the  center. 

We  denote  the  service  rates  at  center  m  in  the  first  case  by  a™(nm),  where  71™  is 
the  number  of  customers  at  the  center.  By  definition,  0.^(1)  =  1.  Service  rates 
at  center  m  for  class  r  for  the  second  type  of  load  dependency  are  denoted 
Or.mOv.m).  where  rv.m  is  the  number  of  class  r  customers  at  center  m.  By 
definition,  0,^(1)  =  1.  k 

Four  basic  scheduling  disciplines  are  allowed  at  the  service  centers: 

1.  First-Come-First-Served  (FCFS):  All  classes  must  have  the  same 

mean  service  time  at  the  center.  Only  type  1  service  rates  are 
allowed  (i.e.,  they  may  depend  only  on  the  total  number  of  cus¬ 
tomers  at  the  center). 

2.  Processor  Sharing  (PS):  This  is  the  limiting  case  of  round  robin  as 

the  quantum  size  goes  to  zero. 


3.  No  Queueing  (NQ):  The  service  center  can  serve  an  unbounded 
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number  of  customers  in  parallel,  so  no  queueing  can  take  place 
there. 

4.  Last-Come-First-Served-Preemptive-Resume  (LCFSPR):  A  customer 
arriving  to  the  service  center  is  placed  in  service  immediately, 
preempting  the  customer  in  service  if  there  is  one.  Preempted 
jobs  are  placed  on  a  stack  and  continued  on  a  LCFS  basis. 

A  product  form  network  is  defined  by  specifying  values  for  the  parameters 
described  above. 

Once  the  network  has  been  defined,  it  may  be  solved.  By  solving  a  queueing 
network  model  we  usually  mean  computing  the  equilibrium,  or  long  run,  proba¬ 
bility  of  observing  the  network  in  each  of  its  possible  states  (this  will  be 
explained  further  in  chapter  3).  This  information  is  then  used  to  compute  high 
level  performance  measures.  Among  the  measures  commonly  computed  are: 

1.  Throughput  -  The  number  of  customers  per  time  unit  that  pass 

through  some  point  in  the  network. 

2.  Utilization  -  The  percentage  of  time  that  a  service  center  is  busy. 

3.  Queue  Length  Distribution  -  The  equilibrium  probability  associated 

with  each  possible  queue  length  at  a  service  center. 

4.  Mean  Queue  Length  at  a  Service  Center. 

5.  Mean  Response  Time. 

Each  of  the  above  measures  can  be  computed  for  the  entire  population,  or  can 
be  computed  for  each  class  in  the  network.  For  instance,  one  might  be 
interested  in  the  total  utilization  of  some  service  center,  as  well  as  the  break- 
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down  of  the  utilization  among  all  classes  cf  customers  using  the  center. 

As  mentioned  earlier,  the  significant  characteristic  of  separable  networks 
as  contrasted  with  other  queueing  network  models  is  that  their  solution  can  be 
obtained  relatively  quickly.  In  chapter  3  we  introduce  the  two  major  solution 
techniques  for  separable  queueing  network  models  and  show  that  they  require 
time  and  space  (for  load  independent  networks)  on  the  order  of 

P.M  n  (A'r  +  l) 

r  =  1 

where  R  is  the  number  of  customer  classes  in  the  network,  M  the  number  of  ser¬ 
vice  centers,  and  Nr  the  number  of  customers  in  class  r.  This  order  of  complex¬ 
ity  shows  that  even  for  networks  with  a  dozen  classes  with  ten  customers  each,  it 
is  unreasonable  to  expect  to  compute  the  exact  solution.  Since  models  of  this 
size  are  quite  likely  to  occur  when  studying  distributed  systems,  there  is  a  need 
for  cheap,  approximate  solution  techniques. 

Figure  2.1  -  Example  Model 


DISK-1 

DISK-2 

DISK-3 

DISK-4 


Na  =  4 

2A  —  1  0  .35  .35  .3 
SA  =  .111  0  1  .667  .5 
XA  =  .111  0 .25 .233  .15 

UA  -  .256  0  .80S  .537  .345 
Ta  =  2.30  0  .81  .81  .69 
Qa  -  .731  0  1.825  .913  .481 

Before  briefly  summarizing  related  w 


Nb  =  2 
es  =  1100  0 
SB  =  1  .667  0  0  0 
XB  —  1  .667  0  0  0 

UB  =  .641  .428  0  0  0 
Tb  =  .64  .64  0  0  0 
Qb  =  1.431  .569  0  0  0 

k  on  the  solution  of  large  networks, 


we  first  give  an  example  of  a  queueing  network  model  for  illustrative  purposes. 


t 
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(This  example  is  taken  from  [Baskett  et  al.  75].)  The  model  is  shown  in  figure  2.1, 
and  its  parameter  values  are  listed.  The  queueing  network  represents  a  com¬ 
puter  system  with  a  CPU  and  four  disk  drives.  As  shown  in  the  diagram  of  figure 
2.1,  the  model  has  a  service  center  corresponding  to  each  hardware  device  of 
the  physical  system.  These  centers  are  labeled  CPU  and  D1SK_1  through 
DISK_4.  Measurements  of  the  computer  system  have  shown  that  there  are  two 
types  of  jobs  in  the  system.  The  first  type,  represented  in  the  model  by  class  B, 
uses  only  the  CPU  and  DISK_1.  The  second  type,  represented  by  class  A,  uses 
the  CPU  and  disks  2  through  4.  These  usage  patterns  are  reflected  in  the  rela¬ 
tive  throughput  vectors  eA  and  e#.  Vectors  SA  and  Sg  give  the  mean  service 
requirement  for  each  visit  to  the  service  centers  for  each  customer  class. 
Finally,  NA  and  N&  give  the  multiprogramming  levels  (or  number  of  customers) 
of  classes  A  and  B  respectively.  We  have  assumed  that  all  service  centers  are 
load  independent,  and  that  all  disks  have  FCFS  scheduling,  while  the  CPU  has  PS 
scheduling. 

From  the  above  parameters  we  can  calculate  the  relative  utilization  vectors 
XA  and  A#,  and  solve  the  model.  The  results  of  this  solution  are  listed  at  the  bot¬ 
tom  of  figure  2.1. 

In  a  typical  modeling  study,  the  parameter  settings  for  the  model  are 
derived  from  measurement  information  of  the  actual  system.  In  those  cases, 
one  has  the  opportunity  to  validate  the  output  of  the  model  against  the  meas¬ 
ured  values  for  the  same  statistics  (usually  utilizations  and  mean  queue 
lengths).  If  the  two  do  not  match,  there  is  probably  some  error  with  the  collec¬ 
tion  or  interpretation  of  the  measurement  data,  the  derivation  of  the  parameter 
settings  from  the  measurement  data,  or  the  simplifying  assumptions  made  in 
creating  the  model.  These  errors  must  be  investigated  and  the  model  modified 
until  it  agrees  with  the  measurements. 
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Table  2.2  -  Faster  CPU 


Modifications: 

Na  =4 

eA  =  (1  0  .35  .35  .3) 

SA  -  .055  0  1  .66?  .5 
XA  =  .055  0  .35  .233  .15 

Solution: 

UA  =  .136  0  .859  .573  .368 
Ta  =  2.45  0  .86  .86  .74 
Qa  =  .290  0  2.144  1.035  .531 

Once  the  model  is  validated,  it  can 


NB  =  2 

eB  =  (1  1  0  0  0) 
SB  =  .5  .687  0  0  0 
XB  =  .5  .667  0  0  0 


UB  =  .535  .713  0  0  0 
Tb  =  1.07  1.07  0  0  0 
Qb  =  .904  1.096  0  0  0 

used  for  performance  prediction. 


Typical  applications  of  such  models  include  exploring  the  effect  of  upgrading 


some  hardware  component,  of  redistributing  files  among  the  disks,  and  of  vary¬ 
ing  the  multiprogramming  levels.  In  table  2.2  we  show  the  results  of  the  model 
under  the  assumption  that  the  CPU  was  upgraded  to  one  twice  as  fast.  This 
change  is  reflected  in  the  model  by  halving  the  CPU  entries  in  the  service  time 


vectors  SA  and  SB,  and  re-solving  the  model.  Table  2.3  shows  the  effects  of 
redistributing  the  files  for  class  B  so  that  D1SK_2,  DISK_3,  and  DISK_4  are  util¬ 


ized  equally. 


Table  2.3  -  Balanced  Disk  Usage 

Modifications: 

Na  =4 

^  =  1  0  .222  .333  .444 
SA  =.111  0  1  .687  .5 
XA  =  .111  0  .222  .222  .222 


Nb=2 

eB  =  1  1  0  0  0 

SB  =  1  .667  0  0  0 


Solution: 

U*  =  .292  0  .584  .584  .584 
Ta  =  2.63  0  .58  .88  1.17 
Qa  =  .924  0  1.025  1.025  1.025 


UB  =  .618  .412  0  0  0 
Tb  -  .818  .618  0  0  0 
Qb  =  1.457  .543  0  0  0 


This  change  is  reflected  in  the  model  by  changing  the  relative  throughput  vector 


eB  and  re-solving  the  model.  Finally,  table  2.4  shows  the  effect  of  Increasing  the 
multiprog amming  levels  under  the  assumption  that  a  correspondingly  larger 
amount  of  main  memory  had  been  obtained.  This  assumption  is  required  so  that 
memory  contention  is  identical  in  both  the  old  and  new  configurations,  and  no 
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Table  2.4  -  Increased  Multiprogramming  Levels 
Modifications: 

Na  =  & 

eA  =  1  0  .35  .35  .3 
SA  =  .111  0  1  .667  .5 
XA  =  .111  0  .25  .233  .15 

Solution: 

UA  =  .269  0  .847  .565  .363  UB  =  .687  .458  0  0  0 
Ta  -  2.42  0  .85  .85  .73  TB  =  .687  .687  0  0  0 

Qa  =  1.15  0  2.28  1.05  .53  QB  =  2.29  .71  0  0  0 

additional  memory  management  overhead  is  required  in  the  model.  Appropriate 
model  parameter  settings  are  obtained  simply  by  altering  the  settings  for  NA 
and  Nb  . 

The  example  discussed  above  is  an  indication  of  the  simplest  kinds  of 
queueing  network  models  that  have  been  successfully  applied  in  practice  (see 
[Kienzle  and  Sevcik  79]  for  a  survey).  In  general,  the  predictions  of  the  models 
have  been  accurate  to  about  5%  for  utilizations  and  10%  mean  queue  lengths. 
Considering  the  complexity  of  the  systems  being  modeled,  these  results  are 
quite  remarkable. 


Nb  =  3 
eB  =  1  1  0  0  0 
SB  =  1  .667  0  0  0 
XB  =  1  .667  0  0  0 


2.2  COMPUTATIONAL  METHODS 

In  this  section  we  briefly  describe  the  history  of  the  efficient  computational 
procedures  for  the  solution  of  closed,  separable  queueing  network  models. 
These  procedures  are  subsequently  treated  in  more  depth  in  chapter  3. 

Buzen  [71,73]  developed  the  basis  of  the  convolution  class  of  solution  tech¬ 
niques  as  methods  for  solving  single  class  models.  There  are  two  variations  of 
the  technique,  one  for  load  independent  service  centers,  and  one  for  load  depen¬ 
dent  service  centers.  The  load  dependent  algorithm  involves  a  vector  convolu¬ 
tion  for  each  service  center,  and  therefore  requires  time  proportional  to  MN?\ 
For  load  independent  service  centers  some  simplifications  are  possible  that 
reduce  the  complexity  to  0(MN).  Both  algorithms  require  space  0(N). 
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The  result  of  Buzen's  algorithm  is  a  vector  from  which  performance  meas¬ 
ures  of  the  system  with  customer  populations  1,  2 can  be  derived. 
Throughputs  (and  therefore  utilizations  for  load  independent  centers)  can  be 
computed  from  this  vector  in  two  operations  for  each  customer  population  of 
interest.  Mean  queue  lengths  and  queue  length  distributions  can  be  calculated 
also  in  time  0(N). 


Kobayashi  [76 j  proposed  a  modification  of  Buzen’s  algorithm  based  on 
Polya’s  theorem  of  enumeration.  The  algorithm  applies  only  to  single  class,  load 
independent  networks.  This  technique  requires  the  same  space  as  Buzen’s  algo¬ 


rithm,  but  time  Q(N z)  instead  of  0(MN).  It  is  therefore  useful  for  networks 

where  the  number  of  service  centers  exceeds  the  number  of  customers. 

A  number  of  authors  have  suggested  extensions  to  Buzen’s  algorithm. 
Chandy  et  al.  [75]  and  Reiser  and  Kobayashi  [76]  modified  the  technique  to 
include  multiple  class  networks.  These  techniques  replace  vector  convolutions 
with  matrix  convolutions  (for  load  dependent  service  centers)  and  require  time 


„  „A  (Nr+Z)(Nr+l) 

°Fn — — s — 

r= 1  c 


and  space  0 


r=* 


The  result  of  these  algorithms  is 


a  matrix  from  which  performance  measures  for  each  class  can  be  obtained  in  a 


manner  analogous  to  those  used  by  Buzen  for  single  class  networks. 


Reiser  and  Lavenberg  [78,80]  developed  a  different  type  of  solution  tech¬ 
nique  for  multi-class  networks  with  restricted  types  of  load  dependent  service 
centers.  Their  technique,  called  Mean  Value  Analysis  (MVA),  is  based  on  Little’s 
theorem  [Little  61],  which  relates  throughput  to  mean  residence  time  and  mean 
queue  length.  MVA  has  nearly  the  same  time  and  space  requirements  as  the 
convolution  techniques  for  load  independent  networks,  but  is  more  expensive  for 
load  dependent  networks.  Its  principal  advantages  are  that  it  eliminates  some 
numerical  problems  that  convolution  can  experience  because  of  the  large  range 
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of  values  required  at  times  using  that  technique,  and  that  it  is  more  easily 
understood  and  implemented  since  it  is  based  on  a  simple  recurrence  relation¬ 
ship  involving  Little’s  theorem.  However,  MVA  can  be  unstable  for  load  depen¬ 
dent  networks  because  of  numerical  significance  problems. 

Chandy  and  Sauer  [80]  proposed  a  solution  technique  that  is  a  hybrid 
between  convolution  and  MVA  techniques.  In  this  way  some  of  the  numerical 
problems  of  each  are  minimized.  Chandy  and  Sauer  [80]  also  present  an  algo¬ 
rithm  for  load  independent  networks  that  trades  time  for  space  in  situations 
where  space  is  critical  (such  as  on  programmable  calculators).  However,  this 
technique  greatly  increases  the  time  required  to  solve  the  network,  and  so  is 
most  likely  not  a  suitable  choice  for  general  purpose  solution  routines.  To  date 
there  is  no  single  technique  that  can  be  used  successfully  for  every  kind  of 
separable  queueing  network. 

2.3  PREVIOUS  WORK 

In  this  section  we  briefly  summarize  related  work  on  the  approximate  solu¬ 
tion  of  large  queueing  network  models.  We  begin  with  the  work  of  Chandy  et  al. 
[75].  This  paper  does  not  present  an  approximate  solution  technique,  nor  does 
it  deal  with  the  solution  of  large  queueing  network  models.  Nonetheless,  it  is 
highly  relevant  to  our  work  since  it  is  concerned  with  the  aggregation  of  service 
centers  in  separable  queueing  network  models. 

Chandy  et  al.  [75]  describes  a  technique  for  replacing  a  group  of  service 
centers  in  a  queueing  network  model  by  a  single  "flow  equivalent"  service 
center.  In  our  terminology,  many  service  centers  are  aggregated  into  one  ser¬ 
vice  center.  The  aggregation  is  performed  by  isolating  the  set  of  service  centers 
to  be  aggregated  from  the  rest  of  the  network.  This  subnetwork  is  solved  in  iso¬ 
lation  for  all  possible  elastomer  populations  that  might  reside  in  that  portion  of 
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the  full  network.  The  throughputs  obtained  from  these  solutions  are  then  used 
to  compute  the  rate  at  which  customers  leave  the  subset  to  the  rest  of  the  net¬ 
work  conditioned  on  the  customer  population  in  the  subset.  A  "flow  equivalent" 
service  center  is  then  formed  with  load  dependent  service  rates  corresponding 
to  the'  population  dependent  departure  rates  of  the  subnetwork.  Finally,  this 
single,  equivalent  service  center  is  inserted  in  the  network  in  place  of  the  sub¬ 
network,  and  the  resulting  aggregate  network  is  solved.  An  illustration  of  this 


Figure  2.2  -  Decomposition  Technique 

Chandy  et  al.  showed  that  this  aggregation  is  exact  in  separable  networks; 
that  is,  the  performance  measures  for  those  service  centers  not  in  the  subnet¬ 
work  are  identical  in  the  aggregate  solution  and  the  solution  of  the  original  net¬ 
work.  Denning  [75]  and  Sauer  [79]  showed  further  that  it  is  also  easy  to  com¬ 
pute  exactly  the  performance  measures  of  the  service  centers  in  the  subnet¬ 
work  from  those  of  the  flow  equivalent  service  center.  However,  the  technique 
does  not  reduce  the  effort  required  to  solve  a  queueing  model.  Instead,  it 
divides  the  work  required  into  two  separate  steps.  Some  computational  savings 
might  be  realized  if  one  is  intending  to  solve  a  number  of  related  networks  such 
that  there  is  a  subnetwork  common  to  all  the  networks.  In  that  case,  the  sub¬ 
network  can  be  solved  once  and  replaced  by  a  flow  equivalent  server.  The  work 
required  to  solve  the  subnetwork  therefore  need  not  be  duplicated  for  the  solu- 


tion  of  each  variation,  and  the  total  computational  effort  is  reduced.  However, 
the  technique  as  presented  is  of  no  benefit  in  situations  where,  for  instance,  the 
standard  solution  techniques  require  excessive  storage  because  of  the  size  of 
the  network.  In  chapter  6  we  examine  an  approximate  aggregation  technique 
that  can  be  applied  in  those  cases. 

Reiser  and  Lavenberg  [78,80a]  and  Reiser  [79a]  have  developed  an  approxi¬ 
mate  solution  technique  that  is  intended  to  reduce  the  effort  required  to  solve 
networks  with  large  numbers  of  classes.  Their  technique  is  based  on  an  itera¬ 
tion,  the  key  to  which  is  an  estimate  of  the  change  in  the  mean  queue  lengths  of 
class  r  at  all  service  centers  when  the  number  of  class  r  customers  is  varied 
from  Nr—1  to  Nr.  This  estimate  is  obtained  by  solving  R  single  class  models  with 
only  the  class  r  population  in  model  r.  The  influence  of  customers  in  the  other 
classes  is  reflected  in  these  single  class  models  by  inflating  the  service  time  at 
each  service  center  in  proportion  to  the  utilization  of  that  center  by  all  classes 
except  class  r.  The  difference  in  the  mean  queue  lengths  of  the  two  models  is 
then  used  as  the  required  estimate. 

As  pointed  out  in  chapter  8,  this  technique  can  sometimes  give  reasonable 
results.  However,  the  isolation  of  the  classes  in  distinct  models  can  often  lead  to 
significant  error. 

Bard  [79]  also  presents  an  iterative  solution  technique.  His  procedure  is 
intended  for  networks  with  very  large  customer  populations  (he  calls  it  the 
"infinite  customer"  approximation).  The  technique  is  an  iteration  based  on 
mean  queue  lengths  at  the  service  centers.  Bard’s  estimate  is  that,  in  the  limit 
as  the  customer  population  grows,  all  customers  are  in  the  queue  at  the 
bottleneck  service  center.  This  fact  is  used  for  the  basis  of  an  iteration  to 
obtain  an  approximate  solution  for  finite  populations. 

The  approximation  published  in  Bard  [79]  gives  results  that  are  often  very 
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poor.  An  as  yet  unpublished  refinement  of  his  technique  does  work  remarkably 
well,  even  for  very  small  customer  populations.  However,  this  refinement  iso¬ 
lates  customer  classes  in  much  the  same  way  that  Reiser’s  technique  does,  and 
so  is  subject  to  the  same  types  of  inaccuracies.  These  approximations,  as  well 
as  Reiser’s,  are  considered  in  more  detail  in  chapter  8.  We  show  there  that  the 
isolation  of  the  classes  can  lead  to  significant  error,  and  present  a  new 
refinement  of  these  techniques  that  overcomes  this  drawback. 


CHAPTER  3  -  EXACT  SOLUTION  TECHNIQUES  FOR 


SEPARABLE  QUEUEING  NETWORKS 


In  this  chapter  we  examine  the  space  and  time  requirements  of  the  two 
principal  exact  solution  techniques  for  separable  queueing  network  models,  the 
convolution  technique  and  mean  value  analysis  (MVA).  Complexity  analyses  are 
presented  for  the  amount  of  work  required  to  go  from  network  description  to 
final  performance  measures.  As  only  the  complexity  measures  that  are  the 
results  of  this  chapter  are  important  to  subsequent  chapters,  the  reader 
interested  primarily  in  the  approximation  methods  is  encouraged  to  skip  over 
the  detailed  analysis  given  here  and  proceed  to  chapter  4. 

The  characteristic  of  separable  networks  that  makes  them  amenable  to 
efficient,  exact  solution  is  that  their  steady  state  probabilities  have  product 
form.  Let  there  be  R  classes  in  the  network  (each  a  separate  closed  routing 
chain)  and  M  centers.  Let  the  multiprogramming  level  of  class  r  be  Nr.  Let  erm 
be  the  relative  throughput  of  class  r  customers  at  center  m  (that  is,  the  ratio 

-  r,m  is  equal  to  the  ratio  of  the  throughputs  of  class  r  at  centers  m  and  j).  Let 

er,j 

the  service  time  for  class  r  at  center  m  be  SKm.(n).  We  allow  SV.m  to  be  a  func¬ 
tion  of  either  the  total  number  of  customers  at  center  m  or  the  number  of  class 
r  customers  there.  Center  m  is  load  dependent  if,  for  some  m,n. 

Otherwise,  it  is  load  independent.  The  steady  state  probabil¬ 
ity  of  state  (where  each  nm  is  a  vector  describing  the  number  of 

customers  of  each  class  resident  at  center  m)  is  given  by 

P{nh,..,nM)  =  {I/O)  *  n  Pmi'Thn) 

m  =  1 

Here  F^n^)  is  a  function  that  depends  on  the  sendee  discipline  of  center  m, 
and  G  is  a  normalizing  constant  chosen  so  that  the  sum  of  the  probabilities  of  all 
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states  is  one. 

The  first  efficient  solution  technique  for  networks  of  this  type  was  developed 
by  Buzen  [71,73],  and  involved  vector  convolutions  for  the  computation  of  the 
normalizing  constant.  His  method  applied  only  to  single  class  models.  Buzen’ s 
basic  technique  was  extended  by  Chandy  et  al.  [75]  and  Reiser  and  Kobayashi 
[76]  to  include  multiple  class  networks  by  the  use  of  matrix  convolutions  in 
place  of  vector  convolutions.  In  both  the  single  and  multiple  class  cases, 
simplified  algorithms  not  requiring  convolutions  are  possible  for  load  indepen¬ 
dent  centers. 

Reiser  and  Lavenberg  [78]  developed  a  different  solution  technique  that 
avoids  the  problem  of  computing  the  normalizing  constant.  Their  solution  is 
based  on  applications  of  Little's  equation  [Little  61]  both  to  the  network  as  a 
whole  and  to  individual  service  centers.  The  solution  is  obtained  by  computing 
the  performance  measures  for  class  r  based  on  the  solution  of  the  same  network 
with  a  customer  population  having  one  less  class  r  customer. 

Both  the  convolution  algorithm  and  MVA  can  suffer  from  numerical 
difficulties.  The  convolution  algorithm  experiences  problems  with  floating  point 
range,  causing  either  underflow  or  overflow  with  consequent  loss  of  accuracy. 
Mean  value  analysis  avoids  this  problem,  but  can  prove  numerically  unstable  for 
networks  with  load  dependent  service  centers.  A  number  of  modifications  have 
been  proposed  to  the  basic  algorithms  that  address  these  problems  ([Chandy 
and  Sauer  80],  [Reiser  80]).  These  methods  are  basically  hybrids  of  the  convolu¬ 
tion  and  MVA  approaches,  and  so  have  time  and  space  requirements  like  them 
(one  technique  stores  scaled  normalizing  constants  in  a  convolution-like 
manner,  and  the  other  unnormalized  queue  lengths  in  an  MVA-like  manner).  We 
therefore  do  not  treat  them  explicity  here*. 

*Chandy'and  Sauer[0O]  also  present  a  new  algorithm  that  is  designed  to  sacrifice  time  for  a 
large  space  savings.  As  the  technique  is  intended  only  for  processors  with  extremely  limited 
storage  (such  as  hand  calculators),  we  do  not  treat  it  here. 
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The  remainder  of  this  chapter  deals  with  the  space  and  time  requirements 
of  these  two  solution  methods.  We  assume  that  the  values  of  the  relative  utiliza¬ 
tions,  XT'm (n. ) ,  defined  by 

~  er,m  *  Srm(7l) 

are  known  for  both  solution  methods.  It  is  almost  always  possible  to  obtain 
these  values  either  directly  or  with  a  few  simple  calculations  from  measurement 
data.  Computing  the  values  from  branching  probabilities  (the  traditional  tech¬ 
nique)  requires  an  eigenvector  solution  of  the  branching  probability  matrix,  an 
0({RM)2)  operation  in  general.  (Faster  solutions  are  possible  for  special  net¬ 
work  topologies,  e.g.,  central  server  models  [Buzen  71]).  However,  as  the  time 
and  space  required  to  obtain  the  network  description  is  identical  for  both  solu¬ 
tion  methods,  it  is  irrelevant  to  a  comparison  of  the  two  and  is  therefore  ignored 
in  the  rest  of  this  chapter. 

Anatyses  of  both  MVA  and  the  convolution  technique  divide  naturally  into 
load  independent  and  dependent  cases.  For  simplicity,  we  consider  only  the 
basic  algorithms  for  networks  that  are  entirely  load  dependent  or  load  indepen¬ 
dent.  Numerous  modifications  to  reduce  the  computation  in  special  cases,  for 
example  for  service  centers  with  multiple  servers,  have  been  proposed  in 
([Reiser  and  Kobayashi  75],  [Kobayashi  76],  [Reiser  and  Sauer  78],  [Reiser  and 
Lavenberg  78],  [Bruell  78]).  However,  because  these  would  unnecessarily  com¬ 
plicate  the  analyses,  they  are  not  considered  here. 

3.1  CONVOLUTION  TECHNIQUE 

The  convolution  solution  technique  requires  that  a  series  of  matrices, 
be  computed.  Each  matrix  is  R  dimensional  with  indices  running 
from  0  to  Nr  (the  multiprogramming  level  of  class  r)  in  the  rth  dimension.  An 
element  Cm(7i1)n2,...,n/?)  is  the  relative  (i.e.,  unnormalized)  probability  that 


-  27  - 


customers  of  class  1,  n2  customers  of  class  2,...,  nR  of  class  R  reside  at  the  first 
m  service  centers.  The  purpose  of  the  computation  of  the  Cm  is  to  derive  the 
normalizing  constant  for  the  network.  The  normalizing  constant  is  given  by 
Gm(N since  this  is  the  sum  of  the  relative  probabilities  of  all  the  states. 

The  computation  of  the  Gm  can  be  done  in  two  distinct  ways,  depending  on 
whether  center  m  is  load  dependent  or  load  independent.  Similarly,  the  compu¬ 
tation  of  the  performance  measures  for  center  m  depends  on  whether  or  not  it 
is  load  dependent.  We  therefore  divide  the  discussion  into  these  two  cases, 
beginning  with  the  load  independent  case. 

Load  Independent:  Time 

For  load  independent  centers  m,  the  computation  of  the  Cm  from  Gm-i  is 

# 

straightforward: 

R 

Gm(nlt...,nR )  =  Cm_i(ni,...,7ii?)  +  2  Grn{nh...,nr-1 . nR)  ♦  Xr,m  (3.1) 

r- 1 

Each  element  of  Gm  requires  R  multiplications  and  R  additions.  The  total  work 

R 

required  to  compute  each  Gm  is  therefore  R  T~j  (Nr+ 1)  multiplications  and  an 

r=  1 

equal  number  of  additions. 

Having  computed  Gy,  it  is  possible  to  compute  efficiently  a  number  of  per¬ 
formance  measures  of  interest.  We  consider  here  the  computation  of 
throughput,  utilization,  and  mean  queue  length  by  class/center  pair.  A  sum¬ 
mary  of  results  is  given  in  table  3.1. 

The  throughput  of  class  r  customers  at  center  m,  Tr<m,  is  given  by 

„  _  Gut.Ni . Nr- 1 . NR) 

rm  er'm  Gu{Nl . Nr) 


Since  we  need  to  compute  the  quotient  of  the  two  elements  of  Gm  only  once  per 
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Table  3.1  -  Convolution  Load  Independent  Time 


EACH _ __  ALL _ _ _ WjjrC U.,J, 


RP 

RP  + 

RMP  * 

RMP  4- 

RM  P+  * 

RM  P+  + 

Tput  1  * 

1 . / 

Cif+ 

RM  * 

R  / 

RM(L  + 1)  * 

R(L  +  1)  / 

Util.  CM  +  Tput 

1  • 

CM  +  Tput 

RM  * 

Gu  +  Tput 

RM(L  + 1)  * 

CAf  + 

Mean  l+JVrF~  * 

Qlen  1  / 

NrP~  + 

Gm  + 

R(M-l){P  +  l)  * 

R{M- 1)  / 

R(M-l)P+R(M-2)  + 

R 

+ 

R  (M—1)(L  +1)  / 

R{M-1)P++R(M-2)(L  +  1)  + 

R(L  + 1) 

TOTAL 

(2RM-R){P  +  1)+RM  * 
RM  / 

(2RM-R)P+R(M-2)  + 

R 

R(2M-l)P++RM(L+l)  * 

R{M-i){L+l) 

R(2M-1)P++R(M-2)(L+1)  + 

R(L  + 1) 

5RM-2R  A-{ARM-2R  )P  Ops 

(L  +  1)[5RM -2R  +(4RM -2R)P^  Ops 

p  =  n(wr+i) 

r=  1 


p+  =  (t+D  n  (Af,+D 


class,  the  total  computation  requires  R  divisions  and  RM  multiplications  to  com¬ 
pute  all  the  throughputs. 

The  utilization  of  center  m  by  class  r,  Urt7n,  is  given  by 


Ur,m  ~  X? 


r,m 


Cjr^l . NB)- 


—  c1  *  T 

-  °r,m  1  r,m 

Again,  a  total  of  R  divisions  and  RM  multiplications  are  required  to  compute  all 
the  utilizations  from  G u,  or  RM  multiplications  in  total  if  the  throughputs  are 
known. 


Mean  queue  length  at  load  independent  center  m  for  class  r,  Qrm,  can  be 


Qr%m  —  %r. 


m 


GjV+m(Ni,  ...,Nr  1,...,Nr) 


computed  by 
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where  G^+m  is  computed  from  G m  according  to  (3. 1)  using  the  Xr  m  for  center  m. 
If  the  queue  lengths  at  all  centers  are  to  be  computed,  time  can  be  saved  by 
computing  one  of  them  (for  each  class)  by 

Qr.m  =  Xr  —  l_j  Qr>m  (3-2) 

m  =  i 

This  requires  M  —  2  additions  and  one  subtraction  per  class.  The  total  processing 

R 

requirement  is  therefore  R  (M  —  l)  {][  (Nr  + 1)  multiplications  and  the  same 

r=  1 

number  of  additions  to  compute  the  Cjy+rn,  and  R(M- 1)  additional  multiplica¬ 
tions  and  divisions,  R(M- 2)  additions,  and  R  subtractions  to  compute  the  Q-—. 

We  note  here  that  if  one  is  interested  in  the  results  of  the  model  for  more 
than  one  population,  it  is  possible  to  save  computation  by  computing  the  Gm 
such  that  the  index  of  the  Gm  runs  from  0  to  the  maximum  multiprogramming 
level  over  all  the  populations  to  be  considered.  In  this  way  all  those  elements  of 
Gm  that  have  to  be  computed  for  each  population  are  computed  only  once.  Per¬ 
formance  measures  for  each  population  can  be  obtained  from  submatrices  of 
the  Gm  computed  in  this  fashion. 

An  example  of  a  case  in  which  it  is  desirable  to  compute  performance  meas¬ 
ures  for  many  populations  in  parallel  is  examining  the  effect  of  altering  the 
number  of  customers  in  a  particular  customer  class.  In  these  cases,  Nr  might 
vary  from  0  to  some  maximum  L.  The  work  required  to  compute  the  Cm  is 
equivalent  to  computing  the  Gm  for  a  customer  population  of 

The  work  to  compute  the  throughputs  and  utilizations 
for  all  the  populations  is  just  (L  + 1)  times  the  work  for  a  single  population,  as  no 
savings  is  possible  by  computing  them  together.  However,  it  is  possible  to 
reduce  the  total  cost  of  computing  the  mean  queue  lengths  by  computing  G\f+m 
only  once  for  each  center,  and  using  it  to  compute  the  mean  queue  length  for  all 
classes  at  that  center  for  all  the  populations.  These  results  are  tabulated  in 
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table  3.1. 


Load  Dependent:  Time 


In  this  section  we  consider  the  time  requirements  for  networks  containing 
load  dependent  service  centers.  Our  results  are  summarized  in  table  3.2.  Our 
analysis  is  for  the  solution  of  a  network  composed  entirely  of  load  dependent 
centers.  Since  it  is  possible  to  combine  the  load  dependent  and  load  indepen¬ 
dent  algorithms  for  a  single  network,  slight  modifications  must  be  made  for  net¬ 
works  with  both  types  of  centers.  For  these  networks  some  work  can  be  saved 
by  considering  load  dependent  centers  as  the  first  and  last  centers  in  the  com¬ 
putation.  The  total  work  required  is  dependent  on  the  exact  number  of  load 
dependent  and  load  independent  centers,  and  can  be  derived  for  a  specific  net¬ 
work  in  a  straightforward  manner  from  the  results  presented  here. 

The  computation  of  the  Gm  for  load  dependent  centers  m  is  more  complex 
than  for  load  independent  centers  because  a  matrix  convolution  is  required.  For 
load  dependent  centers  we  have 

N  i  Hr 

Gm(ni . nn)  =  S  E  In)  *  nR-jR)  (3.3) 

i  1=0  3r= o 

where  Xm  is  given  by 


The  computation  of  the  Gm 
that  Gj  =  Xi. 


(  E  «™)!  R  TV 

•nR)  =  <m Xr'm{j) 

is  done  according  to  (3.3)  with  the  initial  condition 


The  work  required  to  compute  the  Gm  can  be  divided  into  the  initialization 
of  the  Xm  and  the  convolution  to  compute  the  Gm.  We  first  consider  the  convolu¬ 
tion  step. 

R 

Each  element  Gm{n  h . . .  ,nR)  requires  H(tv  +  1)  multiplications  and 

r=  1 
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(  f|(7lT  +  l))“i  additions.  Therefore,  the  total  work  to  compute  each  Gm  is 

r= 1 


*1 

E 

4= 


* R 

E 


nR=  0 


II("r  +  !) 


r=l 


ft  kNr  +  2)  (ATr  +  1)1 

r=l 1  J 

2* 


multiplications  and 


R 


iw  +  2)  (Nr  + 1) 

r  =  1 


2r 


r  =  l 


additions.  The  work  to  compute  all  the  Gm  is  therefore  ( M  -  1)  times  the  above 
quantities  plus  the  cost  of  computing  X\  (  =  G\). 

The  initializations  of  the  Xm  can  be  done  in  either  of  two  ways.  The  first  of 
these  is  simply  a  straightforward  computation  from  the  definition.  We  can  com¬ 
pute  each  element  of  Xm  using  the  recurrence  relation 

(l+  J]  tO 

1»  •  •  •I'K'r  •  ■  •  i71/?  )  “  {jl  i>  •  •  .i  TT'R  )  *  ^)*  (tLj.  +  l) 

and  the  initial  conditions  that  .^(0,...,  0)=1  for  all  m.  The  computation  of  each 

R  R 

therefore  requires  2  ]]J(Afr  +  l)  multiplications  and  additions  and  ]]J(:Vr+l) 

r=l  r=l 

divisions. 

The  second  method  of  computing  the  Xm  uses  some  additional  storage  to 

hold  the  multinomial  coefficients  that  are  common  to  all  the  Xm.  The  computa- 

R 

tion  of  these  coefficients  can  be  done  with  f]l(Afr+l)  multiplications  and  divi- 

r=l 

R 

sions,  and  2j[|(iVr+l)  additions.  Given  these  coefficients,  each  Xm  can  then  be 

r=  1 

R 

computed  in  2  T~|(.A/rr+l)  multiplications. 
r=i 

Once  the  Gm  have  been  computed,  the  performance  measures  may  be 
determined.  The  throughput  of  class  r  customers  at  center  m,  Trm,  is 
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computed  in  the  same  manner  as  for  load  independent  centers.  This  requires 
RM  multiplications  and  R  divisions  to  obtain  all  the  7V,m. 


Computing  the  utilizations  of  center  m  by  class  r  customers,  Ur<m ,  is  more 
difficult  in  the  load  dependent  case  than  in  the  load  independent  ease.  For 
these  centers,  Ur<m  is  given  by 


J v 


Urm  C*(Ar AT*) 


2  R  m  )  * 

n«=0  2  TLj 

3  =  1 


(3.4) 


Xm(N 

where  GM_m  is  Cj/-i  in  a  computation  where  center  m  is  the  last  to  be  convolved 
in  the  sequence  of  Gm.  Since  it  is  too  expensive  to  compute  M  different 
sequences  of  the  Gm,  Gu_m  must  be  derived  from  Gy  for  the  M- 1  centers  not 
convolved  last  in  the  computation  actually  followed.  This  can  be  done  [Bruell  78] 
by 


nl  nR 

— m  (jl  1»  •  •  ■  ‘  )  “  Gm(ti  \,...,TIr')  ~  S  ^m(j  Jr)  * 

3 1=0  1r~q 


-m  C71 1  “  .7  1*  •  •  ■  > n/?  “  Ji?  ) 


where  Gj£^m(ni,...,nR)  on  the  right  hand  side  is  assumed  to  be  0,  and  with  the  ini¬ 
tial  condition  that  Gjf-m(0,...,  0)  =  1.  This  requires  the  same  operation  count  as 
a  convolution,  plus  one  subtraction  for  each  element  of 

If  all  the  utilizations  are  to  be  computed,  we  can  save  computation  by  com¬ 
bining  effort.  We  compute  each  Cj/_m  only  once  for  each  center  m,  m  *  M  (since 

Gm-\  comes  for  free  out  of  the  convolution  algorithm).  Then,  once  is 

R 

known,  each  term  Gj^-m (71  *  Xm(N i-71u...,Nr-tir)  /  (  Y]  rij)  of  (3.4)  is 

3  =  1 

computed  only  once  for  all  classes.  In  this  way  the  total  work  required  is 
reduced.  Total  computation  costs  for  all  the  utilizations  are  listed  in  table  3.2. 

Mean  queue  lengths  can  be  computed  in  a  similar  manner.  The  mean  queue 
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length,  Qr  rn.  of  class  r  at  center  m  is  given  by 
1  N1 

Qr.m  -~n~Txr - T,  '  '  '  I!  ^  *  Xm(nh..  .,nR)  *  GM-m(N  1  -  nh...,NR  -  nR) 

Gm\N  R)  Ui=0  ni?=0 

This  is  identical  to  the  computation  of  the  Ur/m  except  that  no  divisions  are 
required  to  compute  each  Qr.m-  The  total  work  required  is  therefore  very  similar 
to  that  for  Urm.  The  results  are  tabulated  in  table  3.2.  (We  note  that  some  sav¬ 
ings  can  be  realized  when  computing  mean  queue  lengths  for  all  centers  by 
using  equation  3.2.) 

We  note  that  work  can  be  saved  by  computing  GM_m  once,  and  using  it  to 
compute  both  utilizations  and  mean  queue  lengths  at  center  i.  In  addition,  we 
can  compute  Xm(ni,...,nR)  *  Gu-m(N \-ni,...,NR-nR)  once  for  each  vector 
(n  1(..M nR)  and  use  it  to  compute  both  the  utilization  and  mean  queue  length. 
The  savings  possible  by  these  techniques  is  shown  in  table  3.2. 

If  we  have  more  than  one  customer  population  for  which  we  want  solutions, 
we  can  use  the  same  technique  as  for  load  independent  networks;  i.e.,  comput¬ 
ing  the  maximum  sized  Gm  only  once  and  using  submatrices  of  them  for  the 
other  populations.  An  example  is  given  in  table  3.2. 

CONVOLUTION:  SPACE 

For  the  analysis  of  space  we  shall  assume  that  the  network  is  load  indepen¬ 
dent  if  it  has  only  load  independent  service  centers,  and  load  dependent  other¬ 
wise.  This  is  reasonable  because  a  space  analysis  should  indicate  the  maximum 
storage  needed  at  any  point  in  the  computation.  Therefore,  the  presence  of  a 
single  load  dependent  center  is  equivalent  to  many  load  dependent  ones,  since 
the  maximum  storage  requirements  are  the  same  in  both  cases.  A  summary  of 
results  appears  in  table  3.3. 
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Table  3.2  -  Convolution  Load  Dependent  Time 


^ m 

(from  def.) 
Coefficient 
Matrix 
Xm 

(from  coeff. 
matrix) 


Total  w/o 
Util. 


Throughput 


util. 


Mean  Queue 
Length 


Cm-t 


Util.  & 

Mean  Queue 
Length 

TOTAL 


„  _  (Nr+2)(Nr  +  l) 

Q  -  |i  — g 

r—  1  c 


Q+  = 


(L+2)(L  + 1)  -A-  (Nj  +  2KNj 

2  J  L  2 


+  1) 


2 F  •,+ 

F  /  _ 

2 MP  •,+ 

ATP  / 

2 MP*  •,+ 

MP+  J 

k  10 
| 

^  V 

2F  + 

P  *./ 

2 F  + 

Coeff.  + 

2 F 

Coeff.  + 

2MP  * 

Coeff.  + 

2MP  * 

Xm  + 

(Q-F) 

Q-F  + 

A’l+ATg^'  ‘  *  ■ 

(Af-l)CQ-P) 

(Af-l)Q-F  + 

X  j  2+ . . .  +Afjf+ 

(Af-1)(Q+-F)  * 

(Af-1)Q  +  -F  + 

i 

+ 

< 

i _ 

Gu  + 

FAf  • 

i?  / 

+ 

FAf  (L  + 1)  * 

F(L+1)  / 

GM+Xm+ 

1+F  / 

2 P  * 

2F-1  + 

RM+MP  / 

(RM  +M)P  • 

ARM  +MJP  -RM  + 

(M-DGu-n+Xn* 

{LP1)(RM+MP)  / 

( L+1)(RM+M)P  * 

fL+l)ffFAf+Af)F-FAf)  + 

GU-m+Xm+ 

1  / 

2P  * 

1+P  + 

(M-l)Gu.m+MXm+ 

RM  / 

(. RM+M)P  * 

FAffl+F)  + 

(tf-l)<?*_m+*TO+ 

(L+1)FA/  / 

(L+l)(FAf+A/)F  • 

(L  +  1)FM(1+F)  + 

Q 

Q-P  + 

P 

^+(i!/-l)Zm  + 

(M-1XQ-P)  + 

(M  - 1  )F 

GJsr+(Af--l}ArTO+ 

(Af-1)Q  +  * 

(M-1)(Q+-F)  + 

(M-1)P  + 

Cjtf-TO+-*TO  + 

2+F  / 

2F  * 

3F-2  + 

(Af-l)CJf_m+J«TO  + 

2  RM+MP  / 

(RM+M)P  • 

RM&P- 2L  + 

(Af-l)%_m+Zm  + 
(L+l)(2FAf+AfP+)  / 

tL  +  l){RM+M)P+  • 

iL  +  1)RMJ3P+ -2A  + 

FAf+(FAf+3A/  +  l)F+2(Af-l)Q  • 
F+2FA/  +  (^  +  l)F  / 

(SRM-2M+4)P-RM+Z(M-1)Q  + 

JAf-JlP  _  _ 

(4RAf  +3Af  +5)F  +4(Af  -  1)Q 

+R  C2M  + 1 )  Ops 

1 - 

(4+2A/  +  2FAf  )F  +4(Af  -  1)Q 

+F(.¥  +  1)  Ops 

Load  Independent:  Space 


We  need  storage  for  only  one  Gm  at  a  time  in  load  independent  networks. 
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Table  3.3  -  Convolution  Space 


Load  Independent 

— "ft 

8llWr+l)  Gm  and  Gm+tti 

r-\ 

Load  Dependent 

it 

j_1  (Nr+ 1)  Coefficient  Matrix 

FK^r+D 
r =iJ 

zn(ATr+l)  for  and  GM,  GM-m 

r=l 

R 

4'H  (A/r+ 1)  Total 

_ r- 1  -- 

because  it  is  possible  to  place  newly  computed  elements  of  Gm+i  into  Gm  without 
interferring  with  subsequent  computations  of  the  elements  of  Cm+ 1.  No  other 
storage  is  required  to  compute  the  Gm  beyond  that  necessary  to  hold  the  net¬ 
work  description. 

Since  in  most  cases  the  only  use  of  performance  measures  is  as  output,  no 
storage  is  required  in  general  to  compute  the  performance  measures  above  the 
normal  work  variables  of  the  program.  If  the  performance  measures  were  to  be 
saved  (for  instance,  to  be  used  in  some  sort  of  hierarchical  modeling  process), 
we  would  need  one  location  for  each  measure  for  each  class/center  pair  (e.g., 
RM  locations  to  store  all  the  throughputs). 

An  exception  for  which  we  do  need  additional  storage  is  mean  queue 

lengths.  Their  computation  requires  storage  for  Gj/+m.  The  total  storage 

R 

requirement  for  load  independent  networks  is  therefore  2  |~[(Arr+l)  locations. 

r  =  1 


Load  Dependent:  Space 

If  we  compute  the  elements  of  Cra+1  from  greatest  index  to  lowest,  it  is  pos¬ 
sible  to  overwrite  the  array  for  Gm  with  the  new  elements  for  Gm+ 1.  The  storage 

for  the  computation  of  the  Gm  therefore  requires  one  /^-dimensional  array  of 

R 

(Nr+1)  locati  ons. 

r=l 
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In  addition  to  the  Gm,  we  must  store  the  Xm  for  load  dependent  networks. 

R 

Only  one  Xm  is  needed  at  a  time,  so  that  only  ]^]  (//,.+ 1)  locations  are  required. 

T-  1 

Since  we  must  know  the  Xm  to  compute  the  performance  measures,  we  can  save 

time  at  the  expense  of  space  by  storing  all  M  of  the  Xm.  This  requires 

R  R 

M  n(*r+U  locations  in  total.  Finally,  another  matrix  of  size  ][!(//,.  + 1)  must  be 

r=l  r- 1 

used  if  we  compute  the  Xm  using  the  method  of  stored  multinomial  coefficients. 

For  the  computation  of  the  performance  measures  we  may  have  to  compute 

G. M-m ■  This  requires  an  additional  /2-dimensional  array.  The  total  storage 

R 

requirement  is  therefore  4  j[J(ATr+l)  locations  if  we  do  not  save  all  the  Xm  and 

r= l 

R 

(M  +  3)  J”[(Afr+l)  locations  if  we  do.  Of  course,  a  compromise  can  be  made  by 

r= 1 

saving  only  some  of  the  Xm. 

3.2  MEAN  VALUE  ANALYSIS 

The  basic  MVA  algorithm  is  given  in  figure  3.1.  The  algorithm  begins  by 
computing  the  mean  queue  length  at  each  center,  the  system  throughput  by 
class,  and  the  mean  residence  time  for  each  class/center  pair  for  the  network 
with  populations  consisting  of  only  one  customer  (each  population  having  that 
customer  in  a  different  class).  The  results  of  this  network  are  used  to  compute 
the  same  measures  for  the  network  with  populations  having  one  more  customer 
in  one  of  the  classes.  This  recursive  procedure  is  continued  until  the  solution  of 
the  network  with  the  complete  population  vector  is  achieved. 

MEAN  VALUE  ANALYSIS:  TIME 

The  MVA  algorithm  is  greatly  simplified  if  the  service  center  being  con¬ 
sidered  is  either  load  independent  or  has  a  No  Queueing  service  discipline.  We 
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Figure  3.1  -  MVA  Computational  Algorithm 


Initialize:  Qm(6)  =  0 
Pm(0i5)=  1 


for  all  load  dependent  centers  m 
for  all  load  dependent  centers  m 


Loop  from  ft  =  (0  0  ...  0)  to  (N x  N 2  ...  Nr) 

For  r:=l  until  R  do  begin 

For  m:=  1  until  M  do  begin 


If  center  m  is  load  independent  then 
^r.m  :=^r,m(l 

else  if  center  m  has  a  NQ  service  discipline  then 

w  y 

"  r,  m  ■ 

else  (center  m  is  load  dependent) 

K.m  '■=  Xr,m(l  +  '^-±^Pm(i-l\it-gr)) 

end 

end; 


For  m:=  1  until  M  do 

rw 


Tr  = 


m 


For  m:=  1  until  M  do  begin 

If  center  m  is  load  independent  then 

£?„(*■)  :=  I,rTfy. 


else  if  center  m  is  load  dependent  then  begin 
For  i\- 1  until  i  ft  j  do 

pm(i  W  :=  -^-jJ]Tr(n)Xr,mPm(i-l\7i-Sr); 

Pm( 0  {  )  :  =  1-  1^); 

i=  1 

end 


end; 


end; 

therefore  divide  the  discussion  into  load  independent  (including  NQ)  and  load 
dependent  sections. 


Load  Independent:  Time 

In  the  case  of  load  independent  service  centers,  the  computation  of  the 
mean  residence  times,  Wr7n,  throughputs,  Tr,  and  mean  queue  lengths,  Qm,  is 
the  same  for  each  population  that  is  considered  in  the  algorithm.  We  therefore 
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restrict  the  discussion  to  the  work  associated  with  a  single  population,  and  mul- 
R 

tiply  by  FT {Nr+ 1),  the  number  of  populations,  to  get  the  total  work. 

r=: L 

For  each  population  (ni,...,7z./?)  we  have  to  compute  fKr.m,  Tr,  and  Qm.  For  a 
load  independent  center,  is  given  by 

^r.m  “  Xr,m  * 

The  computation  of  all  the  Wr<m  therefore  requires  RM  multiplications  for  each 
population. 

Each  Tr  requires  one  division  and  (M  -  l)  additions.  The  total  work  for  the 
Tr  is  therefore  R  divisions  and  R(M  -  l)  additions  for  each  population. 

Finally,  the  Qm  require  R  multiplications  and  R  additions  for  each  center  m. 
The  total  number  of  operations  required  for  each  population  is  therefore  RM 
multiplications  and  RM  additions. 

The  computation  of  the  performance  measures  from  the  quantities  com¬ 
puted  during  the  recursion  is  straightforward.  The  throughput  of  class  r  custo¬ 
mers  at  center  m  is  given  by  Trm  =  erm  *  Tr.  This  is  one  multiplication  per 
class/center  pair,  or  RM  multiplications  to  compute  ail  the  throughputs. 

Utilization  is  obtained  by  UriJn  =  XTtVl  *  Tr.  This  requires  a  total  of  RM  multi¬ 
plications  for  all  utilizations. 

Finally,  mean  queue  lengths  by  class  can  be  computed  in  either  of  two  ways. 
They  are  computed  naturally  as  part  of  the  process  of  computing  the  Qm,  and  so 
wc  can  save  these  values.  This  requires  no  additional  computation,  but  does 
require  considerable  extra  storage  if  they  are  saved  for  all  populations,  or  more 
complicated  procedures  if  they  are  saved  only  for  selected  populations.  The 
most  reasonable  approach  seems  to  be  to  compute  the  queue  lengths  directly 
by  Qr.m  =  er,m  *  T r  *  ^ r,m  =  ^r.m  *  ^r.m 


This  takes  one  multiplication  per 
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elass/center  pair  if  the  throughputs  are  known*  and  two  if  they  are  not. 
The  results  of  this  section  are  listed  in  table  3.4. 


Table  3.4  -  MVA  Load  Independent  Time 


Wr.m 

Tr 

Qrn 

Throughput 

Utilization 

Mean  Queue 
Length 

Total 


EACH  ALL  POPULATIONS  _  NT=0 . L 


RM 

* 

RMP 

* 

RMP+ 

* 

R 

/ 

RP 

/ 

RP+ 

/ 

R(M-l) 

+ 

R  (M-l)P 

+ 

R(M-1)P+ 

+ 

RM 

* 

RMP 

* 

RMP+ 

* 

RM 

+ 

RMP 

+ 

RMP+ 

+ 

RM 

* 

RM  {L  +  l)—M 

* 

RM 

* 

RM(L+l)-M 

RM 

* 

RM(L  +  l)-M 

* 

RM  (3+2P) 

♦ 

RM(3(L  +  l)+2P+)-3M 

* 

(2RM+R  )P 

+ 

(2 RM-R  )P+ 

+ 

RP 

/ 

RP+ 

/ 

/ 

RM(3+4P) 

Ops 

4 RMP + + 3  RM  (L  +  1)-3M 

Ops 

Load  Dependent:  Time 

The  computation  of  the  Tr  for  load  dependent  centers  is  identical  to  that 
for  load  independent  centers,  and  is  not  restated  here.  However,  the  computa¬ 
tion  of  the  Wr>m  is  different,  and  in  fact  allows  us  to  avoid  computing  the  Qm 
entirely. 

For  load  dependent  center  m,  is  given  by 

K 

~  ^r,m  Xj  “  1,71  j, .. .  ,71r  — 

fc  =  l 

/  \  ^  fyi  )  jFt  .  « 

where  Cm(fc)  =  k~ — t-t-  and  K  -  2_j  TW‘  Thus,  for  each  population  the 

‘-’to  (  1 )  r  =  l 

computation  of  the  FTr  ?n  requires  RMK  additions  and  RM  {K  +  l)  multiplications. 

R 

The  computation  of  the  Ck  is  a  fixed  overhead  of  M  XJ  Nr  divisions  and  multipli- 

r-\ 


cations. 


The  computation  of  the  Wrm  in  the  load  dependent  case  requires  the  quanti- 
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ties  Pm(k  ...ur),  in  place  of  the  Qm.  For  each  population  of  size  K, 

the  computation  of  the  Pn(k)  requires  M(R  -  1  ){K  +  1)  additions,  M(K  +  1) 
divisions,  and  2  RM (K  +  l)  multiplications. 

The  computation  of  the  performance  measures  is  identical  to  that  for  load 
independent  networks,  with  the  exception  that  there  is  no  straightforward 
means  of  computing  the  utilizations.  We  therefore  leave  these  out  of  our  con¬ 
siderations.  (Utilizations  can  be  computed  by  a  procedure  that  creates  a  matrix 
equivalent  to  G u  during  the  mean  value  analysis  solution  process  [Chandy  and 
Sauer  80].  However,  we  do  not  consider  that  here.)  The  time  requirements  of 
MVA  for  load  dependent  networks  are  shown  in  table  3.5. 


Table  3.5  -  MVA  Load  Dependent  Time 


EACH 

ALL  POPULATIONS 

i 

II 

o 

tr-< 

Cm 

MN 

*./ 

MN 

V 

MN 

V 

Cm  "t 

Cm  -t 

Cm  + 

RM(K+1) 

+,* 

RMP  (N +2) 

2 

+  .* 

RMP+  (N  +  2) 

2 

+.* 

R(M- 1) 

+ 

R(M-1)P 

+ 

R(M-1)P+ 

+ 

-^Y.m 

R 

/ 

/ 

RP 

/ 

RP  + 

/ 

M(R-l)(K+l) 

+ 

M  (R  -l)P-(N  +  2^- 

+ 

M(R-1)P+N+ 2 

+ 

Pm{k) 

M(K+ 1) 

t 

/ 

MpM±2 

/ 

Mp+±L±1 

/ 

2RM(K+l) 

* 

RMP(N  +  2) 

* 

RMP*  (N+2) 

* 

M 

- 

MP 

- 

MP + 

- 

Tput 

RM 

* 

RM  (L  + 1) 

* 

Qlen 

RM 

* 

RM(L  + 1) 

* 

N+2 

3  RMP^1^- 

* 

TOTAL 

MN+P(R+M  +  ^J^) 

/ 

p  (^~L(2RM-M)+R  (M- 1)) 

+ 

MP 

- 

3RMP  J  +  2  MN  +  2 RM 

Ops 

R 

K  = 

r  =  l 

N  =  X>V 

r=l 
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The  total  work  required  over  all  populations  is  a  very  difficult  quantity  to 

derive  analytically,  since  K  varies  with  each  population.  An  approximation  for 

the  total  work  in  multiclass  networks  is  to  assume  that  there  are  the  same 

number  Z  of  distinct  populations  for  each  possible  total  number  of  customers  in 

R  i? 

the  network.  Under  this  assumption,  Z  =  (  H  (Afr+ 1))  /  (1  +  f]Nr).  This 

r=l  r- 1 

approximation  overestimates  the  number  of  populations  with  very  few  or  very 
many  customers,  and  underestimates  the  number  of  populations  in  between. 
Since  the  number  of  populations  is  symmetric  around  the  maximum,  we  expect 
this  estimate  to  be  reasonably  accurate.  The  total  work  by  this  estimate  is  then 
P  times  the  above  operation  counts. 

Mean  Value  Analysis:  Space 

The  implementation  of  MVA  given  originally  by  Reiser  and  Lavenberg  [78] 
requires  that  the  Wrm,  Tr,  and  Qm  for  all  populations  be  stored  simultaneously. 
Myhre  [79]  has  modified  the  original  scheme  slightly  by  storing  all  the  popula¬ 
tions  that  have  a  constant  number  A  of  class  r+  customers,  where  r+  is  the  class 
with  the  largest  multiprogramming  level.  The  information  derived  from  the 
solution  of  the  network  with  these  populations  is  then  used  to  compute  the  per¬ 
formance  of  the  network  with  populations  having  A+ 1  class  r+  customers. 

A  more  storage  efficient  scheme,  but  one  that  incurs  significant  execution 

overhead,  has  been  used  in  the  implementation  of  the  BESTER  interactive 

queueing  network  solution  package  [Zahorjan  78].  This  technique  stores  infor- 

M 

mation  about  all  those  populations  (n . ,71^)  such  that  V  rim  is  a  constant. 

i- 1 

These  are  then  used  to  compute  the  statistics  for  populations  having  one  more 
customer  in  any  one  of  the  R  classes.  This  method  requires  the  minimum 
storage  possible  without  generating  statistics  about  any  population  more  than 
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oncc.  The  storage  requirement  for  this  scheme  is  certainly  less  than  those  of 

the  preceding  two  methods,  but  is  difficult  to  analyze  exactly.  It  is  possible  to 

M  R 

show  that  the  maximum  storage  is  required  for  K  =  nm  =  (  Yj  ^r)  /  2,  and 

i=l  r= 1 

that  the  storage  requirements  are  symmetric  around  this  value  [Alieunas  79]. 
However,  estimating  the  amount  of  storage  required  at  maximum  is  very 
difficult. 

Because  of  these  problems,  we  use  Myhre’s  method  as  an  upper  bound  on 
the  number  of  populations  that  must  be  stored  simultaneously.  Since  the  more 
storage  efficient  technique  involves  a  large  overhead  in  computing  the  popula¬ 
tions  for  each  total  number  of  customers,  Myhre’s  method  represents  a 
compromise  in  the  space-time  tradeoff  between  storing  all  the  populations  and 
storing  the  minimum  number. 

Load  Independent:  Space 

For  load  independent  networks  we  need  RM  locations  for  the  Wrm,  R  loca¬ 
tions  for  the  Tr,  and  M  locations  for  the  Qm  for  each  population  being  stored. 

Using  Myhre’s  method,  the  total  storage  requirement  is 

R 

(RM  +  R  +  M)  (Nr  +1)  locations. 

r=l,r  #r+ 

Load  Dependent:  Space 

For  load  dependent  networks  we  need  the  same  storage  as  for  load  indepen¬ 
dent  networks  (except  that  no  Qm  need  be  stored),  plus  we  need  storage  for  the 

R 

Pm(k)  and  Cm(k)  for  all  load  dependent  centers  m.  The  Cm(k)  require 

r= l 

locations  for  each  load  dependent  center.  Each  of  the  Pm(k)  requires  K  loca- 

R 

tions  for  population  (n . . . ,%) ,  where  K  -  tv,  for  a  total  of  M K  locations  for 

r= l 
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Table  3.6  -  MV  A  Space 

LOAD  INDEPENDENT  - 

Each  Population  All  Populations 
_ _ _  (upper  bound) 


wrM 

RM 

RMS 

Tr 

R 

RS 

Qm. 

M 

MS 

Total 

RM+R  +M 

S(RM+R+M) 

LOAD  DEPENDENT  - 

Each  Population  All  Populations 

_ _ _ _ _ _ _ (upper  bound) 


» 

RM 

RMS 

Tr 

R 

RS 

Pmfr) 

~7T~ 

r=l 

R 

SMj]Nr 

Cm 

M£Nt 

T-  1 

ij=x 

r- 1 

Total 

R 

RM+R+2MY,Nr 

_ _r=J _ _ 

Tl  R 

S  (RM+R  +Mj]  Nt)+m£ )  Nr 
_ r=J _ zj=J _ 

s  =  n  +d 

j—XtJ+r 


R 

all  the  Pm(k).  Obviously,  K  is  bounded  by  ][]  Nr.  A  bound  for  the  storage 

r—  1 

required  by  Myhre's  method  is  therefore 

J?  R  R 

Mj]Nr  +  (RM  +  R  +  Mj}Nr)  f[  (Nr  +  1) 

r=l  ral  r=l,r#r+ 


3.3  COMPARISON  OF  MEAN  VALUE  ANALYSIS  AND  CONVOLUTION  TECHNIQUE 

Tables  3.1  and  3.4  show  the  time  complexities  of  the  load  independent  con¬ 
volution  technique  and  MVA  respectively.  The  convolution  technique  requires  a 

R  R 

total  of  5RM-3R  +  (ARM  -2R )  {|(iVr  + 1)  operations  and  2  ]j[(Afr+l)  locations  of 

r=l  r= 1 

R 

storage.  Mean  value  analysis  requires  RM  (3  +  4  FT (Nr+ 1))  operations  and 

r  =T 

R 

(RM  +  R  +  M)  {*}  (Nr  +  1)  locations  for  the  same  networks.  Since  the  opera- 

r=l,rT*r+ 
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tion  counts  are  nearly  identical,  the  major  difference  between  the  two  tech¬ 
niques  is  storage.  Their  relative  efficiency  is  therefore  determined  by  the  ratio 
of  their  storage  requirements  for  the  particular  network  to  be  solved.  The  ratio 


of  the  convolution  storage  requirement  to  that  of  MVA  is 


2(Arr+  +  1) 
(RM  +  R  -f  M)  ' 


The 


relative  efficiency  of  the  two  algorithms  therefore  depends  on  the  particular 
network  to  be  solved. 

A  formal  comparison  of  the  relative  efficiencies  of  the  convolution  tech¬ 
nique  and  MVA  for  load  dependent  systems  is  difficult.  It  is  clear  that  MVA 
requires  a  great  deal  more  storage  than  the  convolution  technique  for  most  net¬ 
works  (i.e.,  for  all  networks  other  than  the  trivially  small  ones).  In  order  to  com¬ 
pare  the  two  methods  with  respect  to  time,  we  present  a  brief  analysis  of  the 
single  class  cases,  a  situation  for  which  exact  analytic  solutions  for  the  time 
requirements  are  possible.  Our  analysis  covers  the  work  required  from  network 
description  to  output  of  the  throughputs  and  mean  queue  lengths. 

Using  the  convolution  algorithm  for  a  single  class  network,  each  Xm  vector 
requires  N  multiplications  (where  N  is  the  number  of  customers  in  the  model), 

or  MM  for  all  the  Xm.  Each  Gm  requires  (i  +  i)  =  +  multiplications 

4= o  d 

and  )■  -//  additions.  The  total  work  for  the  Gm  is  therefore  (M-l) 

C 


times  the  above  quantities  (remembering  that  Gi  =  X^).  The  throughputs  of  the 
service  centers  can  be  computed  with  one  division  and  ¥  multiplications.  The 
mean  queue  lengths  require  the  computation  of  (M-l)£,y_m  plus  M  divisions, 
2M  (N  +  l)  multiplications,  and  MN  additions.  Finally,  the  take 

{M-l)  multiplications,  {M-1)N  subtractions,  and 

( M - 1 ) additions.  This  totals  to  2N2{M  —  l)  +  10MN  -  6M  +  BM  —  3 


operations. 
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N 

MVA  requires  Af£]i  =  M (N*1  +  N)  /  2  additions,  and  M(N2  +  N)  multiplica- 
i- 1 

tions  for  the  Wrm,  plus  MN  divisions  and  multiplications  for  the  0^(1:).  Addition¬ 
ally,  N  divisions  and  (M  -  l)N  additions  are  needed  for  the  Tr.  The  computation 
of  the  Pm{k)  requires  M(N2  +  N)  /  2  divisions,  and  M(N2  +  N)  multiplications. 
Finally,  the  throughputs  and  mean  queue  lengths  require  M  multiplications  each. 

5 

This  is  a  total  of  —MN2  +  6MN  +  2 M  operations. 

If  we  take  the  difference  between  the  two  operation  counts,  we  see  that  the 
convolution  technique  is  more  efficient  in  the  single  class  case: 


MVA  -  CONVOLUTION  =  MN2  +  2N 2  -5 MN  +  QN  -  6M  +  3  operations 


However,  as  the  size  of  the  network  increases,  it  may  be  the  case  that  MVA  is 
more  efficient,  since  its  complexity  is  basically  0  ((]Q(Arr+  l))(Y]Nr))  compared 

r  r 


/_(JVr+ 1)® 

with  convolution’s  0(i  } - — - ). 


Table  3.7  compares  the  two  solution  methods  for  a  number  of  multiclass 
networks.  Both  time  and  space  requirements  are  given.  Networks  with  various 
numbers  of  classes,  centers,  and  total  customers  are  considered. 


3.4  CONCLUSION 

Although  the  convolution  technique  is  faster  and  requires  less  space  in  gen¬ 
eral  than  MVA,  MVA  does  have  at  least  two  advantages.  The  primary  advantage  is 
that  it  is  numerically  stable  (with  the  possible  exception,  as  noted  previously,  of 
some  load  dependent  networks  (see  [Chandy  and  Sauer  80]  and  [Reiser  80])). 
The  convolution  technique  raises  the  relative  utilizations  of  the  servers  to  the 
power  of  the  number  of  customers  in  the  network.  This  can  cause  numerical 
significance  problems.  Mean  value  analysis  avoids  this  problem  by  deriving  the 
solution  in  terms  of  high  level  performance  measures,  thereby  avoiding  the 
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Table  3.7  -  Comparison  of  MV  A  and  Convolution 


LOAD  INDEPENDENT  - 


TIME  SPACE 


M 

R 

Conv. 

MVA 

Conv. 

MVA 

5 

2 

3 

636 

670 

32 

68 

5 

5 

3 

96,366 

102,475 

2048 

8960 

15 

5 

3 

301,496 

307,425 

2048 

24,320 

5 

2 

10 

4,521 

4,870 

242 

137 

15 

2 

10 

14,301 

14,610 

242 

517 

15 

5 

10 

4.7349  E7 

4.8315  E7 

322,102 

1,390,095 

LOAD  DEPENDENT  - 


TIME  SPACE 


M 

R 

Nr 

Conv. 

MVA 

Conv. 

MVA 

5 

2 

3 

2,134 

4,215 

64 

198 

5 

5 

3 

1,644,537 

4,521,747 

4,096 

26,955 

15 

5 

3 

5,787,467 

13,303,097 

4,096 

78,305 

5 

2 

10 

73,699 

83,418 

484 

1,332 

15 

2 

10 

255,219 

247,465 

484 

3,952 

15 

5 

10 

7.016  E10 

7.888  E  10 

644,204 

12,152,780 

computation  of  the  relative  probabilities  of  the  states  and  consequent  numerical 
problems. 

The  other  important  advantage  of  MVA  is  that  it  is  more  easily  understood 
than  the  convolution  algorithm.  The  intuitive  basis  of  MVA  facilitates  the  formu¬ 
lation  of  approximation  techniques  for  non-product  form  networks  ([Bard 
79,80]).  The  mean  value  technique  is  exact  for  separable  networks  because  the 
computation  of  the  fFrm  can  be  done  exactly  for  these  networks.  However,  the 
algorithm  is  valid  for  any  network  for  which  the  Wrm  can  be  computed  based 
upon  the  solutions  of  the  networks  with  smaller  customer  populations.  There¬ 
fore,  if  good  approximations  for  the  Wr/m  are  developed  for  a  non-separabie  net¬ 
work,  MVA  can  be  expected  to  give  good,  approximate  results.  This  approach 
has  been  taken  by  Bard  [79,80]  with  encouraging  success. 


On  the  other  hand,  the  convolution  technique  has  advantages  over  mean 
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value  analysis  other  than  efficiency.  One  of  these  is  the  ability  to  solve  networks 
with  service  centers  whose  service  functions  are  load  dependent  on  information 
other  than  the  total  queue  length  at  that  center  (the  only  form  of  load  depen¬ 
dency  currently  manageable  with  MVA).  This  can  be  an  important  consideration 
if  one  wishes  to  replace  a  group  of  service  centers  with  an  equivalent,  composite 
service  center.  The  composite  service  center  will  be  load  dependent,  but  not  on 
the  toted  queue  length  (in  general).  It  is  therefore  much  more  efficient  to  do 
parametric  studies  where  a  subset  of  the  service  centers  remain  unchanged 
using  the  convolution  technique  rather  than  MVA. 

Another  advantage  of  the  convolution  solution  method  is  the  ability  to 
examine  interesting  aggregates  of  the  steady  state  probabilities  other  than 
throughput  and  mean  queue  length.  For  example,  it  is  a  relatively  simple 
matter  to  derive  utilizations  for  load  dependent  centers  using  the  convolution 
technique,  while  it  is  much  more  difficult  using  MVA.  Another  example  is  queue 
length  distribution,  an  easy  quantity  to  derive  using  convolution,  but  one  that  is 
more  difficult  to  obtain  using  MVA. 

Finally,  the  convolution  technique  is  well  understood,  and  can  be  used  to 
compute  the  solution  of  any  known  separable  queueing  network  [Zahorjan  79]. 
At  present,  MVA  is  applicable  only  to  a  subset  of  the  known  closed,  separable 


networks. 


CHAPTER  4  -  AGGREGATION  OF  CLASSES 


In  this  chapter  we  investigate  the  aggregation  of  classes  as  a  method  for 
reducing  the  cost  of  solution  of  separable  networks.  The  aggregation  of  even  a 
few  classes  should  yield  substantial  improvements  in  execution  and  storage 
requirements  since  both  increase  exponentially  with  the  number  of  classes.  For 
example,  the  solution  of  a  network  with  10  centers  and  10  classes  of  10  custo¬ 
mers  each  requires  on  the  order  of  10 15  operations  and  1012  locations.  A  single 
class  network  with  10  centers  and  100  customers  requires  only  4000  operations 
and  200  locations  of  memory. 

In  the  first  two  sections  of  this  chapter  we  present  two  kinds  of  class  aggre¬ 
gation  that  are  known  to  be  exact,  aggregation  of  classes  in  open  chains  and 
aggregation  of  classes  in  the  same  chain.  These  results  are  then  used  as  a  basis 
for  developing  approximate  class  aggregation  techniques  in  the  rest  of  the 
chapter. 

4.1  AGGREGATION  OF  OPEN  CHAINS 

Consider  a  queueing  network  with  some  open  and  some  closed  routing 
chains  (i.e.,  a  mixed  network).  Since  some  chains  are  open,  the  number  of  cus¬ 
tomers  in  the  system  at  a  particular  moment  can  be  any  finite  number.  The 
state  space  is  correspondingly  infinite,  and  so  the  standard  solution  techniques 
cannot  be  applied. 

Reiser  and  Kobayashi  [75]  and  Krzesinski  and  Teunissen  [77]  have  shown 
that  for  networks  that  are  load  independent  with  respect  to  the  open  chains  it  is 
easy  to  compute  the  marginal  probability  of  a  state  of  the  customers  in  the 
closed  chains  (i.e.,  the  sum  of  the  probabilities  of  observing  a  particular  state  of 
the  closed  chain  customers  over  all  possible  states  of  the  open  chain  custo- 
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mers).  In  fact,  the  distribution  of  these  marginal  states  can  be  computed  in  the 
same  fashion  as  solutions  for  closed  networks. 

Consider  a  product  form  network  QN  with  R  customer  classes.  Let  classes 
1,2 be  members  of  closed  routing  chains,  and  classes  C  +  l,. ...,/?  be 
members  of  open  chains.  We  assume  that  all  centers  are  load  independent  for 
the  classes  in  the  open  chains.  All  non-FCFS  centers  may  be  load  dependent 
with  respect  to  class  r  as  a  function  of  the  number  of  class  r  customers  at  the 
center.  Let  Xrm  be  the  relative  utilization  of  center  m  by  class  r, 
r  =  l,...,C,  m  =  l,...,Af.  We  can  solve  for  the  marginal  probabilities  of  this  mixed 
network  by  solving  the  equivalent  C  class  closed  queueing  network  with  rela¬ 
tive  utilization  vectors  £  defined  by 

Xr,m/0-~U o,m)  if  center  m  is  FCFS,  PS,  or  LCFSPR 
Xrim,  if  center  m  is  NQ 

and  using  the  same  load  dependent  factors  as  in  the  mixed  network.  Here  Uom 
is  the  total  utilization  of  center  m  in  QN  by  the  classes  in  the  open  chains  (i.e., 
classes  C  +  l,.. .,/?).  The  U0>m  can  be  computed  easily  by  solving  the  open  network 
consisting  of  classes  C  +  l  through/?  only. 

The  throughput  and  queue  length  distributions  obtained  from  this 
equivalent  network  are  identical  to  the  corresponding  marginal  measures  of 
the  original  network  QN.  We  can  therefore  reduce  the  problem  of  solving  for  the 
performance  measures  of  the  closed  chains  of  a  mixed  network  to  one  of  solving 
a  closed  network. 

4.2  AGGREGATION  OF  CLASSES  IN  THE  SAME  ROUTING  CHAIN 

Another  aggregation  that  can  be  performed  exactly  is  that  for  classes  in  the 
same  routing  chain  ([Krzesinski  and  Teunissen  77],  [Balbo  et  al.  77],  [Bruell  73]). 
This  result  is  important  both  because  it  allows  us  to  restrict  our  consideration  of 
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queueing  networks  to  those  without  class  changes  and  because  this  exact  aggre¬ 
gation  serves  as  a  guide  for  the  approximate  aggregation  of  classes  in  different 
routing  chains. 

Consider  a  load  dependent  network  QN  with  R  classes,  C  (closed)  chains 
(where  C<R),  relative  utilization  vectors  Xr<m,  and  relative  throughputs  er>m.  We 
allow  load  dependent  rates  for  each  class  that  are  functions  of  the  total  number 
of  customers  at  the  center*.  The  solution  of  this  network  can  be  obtained  by- 
solving  an  equivalent  network  Q$  with  C  classes,  each  a  separate  routing  chain. 
The  relative  utilization  vector  of  class  c  in  the  equivalent  network, 
£cm>  771  =  1,. ..,M,  is  given  by 

^ c,m  ~  Xrm 

r  t  chain  c 

(The  result  of  the  summation  is  well  defined  because  the  relative  utilizations  of 
the  classes  in  a  chain  must  be  in  the  fixed  ratios  to  each  other  defined  by  the 
input  data  specifying  the  network.)  The  multiprogramming  level  of  class  c,  #c,  is 
correspondingly  computed  as 

N  =  V  y 

Jvc  L-i  *vr 

r  c  chain  c 

The  load  dependent  rate  factors  are  unchanged. 

Performance  measures  for  the  R  original  classes  of  QN  are  easily  derived 
from  those  obtained  from  the  solution  of  QN.  In  particular,  the  throughput  of 
class  r  at  center  m,  Tr  m,  is  given  by 

rp  _  er.  TO  rp 

1  r.m  ~  ST1  c.m 

j  c  chain  c 

where  c  is  the  aggregate  class  corresponding  to  the  chain  to  which  class  r 
belongs,  and  er7n  is  the  relative  throughput  of  class  r  at  center  m  in  QN.  For 

load  independent  and  NQ  centers,  the  utilization  Urm  is  given  by 

[Balbo  et  al.  77]  discusses  more  general  formulations  of  this  result.  We  present  here  only 
that  material  that  is  useful  as  background  for  subsequent  work. 
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Ur,m  “  c,ra> 

and  the  mean  queue  length,  Qr,m.  by 

Qr,m  =  {-^r,m/^c,m)*Qc,m 

Similar,  though  more  complicated,  expressions  hold  for  load  dependent  centers 
[Bruell  78].  For  these  centers,  class  dependent  utilizations  and  queue  lengths 
are  computed  from  information  given  by  the  chain  dependent  queue  length  dis¬ 
tributions. 

A  simple  explanation  for  the  aggregationprocess  can  be  given  by  consider¬ 
ing  the  product  form  of  the  state  probabilities.  Consider  a  network  with  two  cus¬ 
tomers,  each  of  which  is  in  a  separate  routing  chain.  Each  chain  consists  of  two 
classes.  Let  classes  1  and  2  be  in  chain  A,  and  3  and  4  in  chain  B.  The  equivalent 
aggregate  network  has  only  two  classes,  with  relative  utilization  vectors 


^ A, m  —  )  7TL  —  1, , M 

^ nr  1 

Consider  the  state  of  the  aggregate  network  with  both  customers  at  some 
load  independent  center,  say  center  1.  The  relative  probability  of  the  state  is 


2*i .  iXB.  i  =  Z(Jf  1. 1+JTs.  i)(*  a.  i+*4.  i) 

“  i  X 3t  i  +  ZX\'  i  X  4i  i  +  2XS,  i  X3t  i  +  2X2,  i  X  4t ! 

The  relative  probability  of  the  state  of  the  aggregate  network  is  the  sum  of  the 

relative  probabilities  of  all  states  in  the  original  network  with  both  customers  in 

either  of  their  possible  classes  at  center  1.  It  is  therefore  the  exact  aggregate 

of  the  corresponding  states  of  the  original  network. 

In  the  more  general  case,  the  aggregation  is  exact  because  of  the  product 
form  solution  of  the  network.  When  a  factor  $c.m  of  is  raised  to  the  power 
ncm  in  a  relative  state  probability,  its  expansion  in  terms  of  the  relative  utiliza¬ 
tions  of  the  original  classes  of  QN  is  the  sum  of  all  possible  allocations  of  the 
7icrn  customers  to  the  classes  in  chain  c.  Therefore  the  aggregation  is  exact. 
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There  are  two  important  properties  of  classes  in  a  single  routing  chain  that 
allow  for  exact  aggregation.  The  first  is  that  the  customers  in  the  chain  can  be 
distributed  in  any  manner  among  the  classes  in  that  chain.  This  is  necessary 
because  the  factor  expands  to  all  possible  distributions. 

The  second  property  is  that  the  ratios  of  the  relative  utilization  vectors  for 
the  classes  in  each  chain  are  fixed  by  the  definition  of  the  network.  The  propor¬ 
tionality  of  the  relative  utilization  vectors  is  determined  by  the  rate  at  which 
customers  change  class.  Since  this  is  an  input  parameter,  appropriate  ratios 
must  be  supplied.  This  characteristic  is  needed  so  that  m  is  not  dominated 
arbitrarily  by  one  of  the  Xrm  of  which  it  is  composed.  If  one  set  of  Xrm  is  scaled 
by  a  factor  s,  all  XjiTn  composing  must  be  similarly  scaled.  This  results  in  a 
new  J?cm  that  is  s  times  the  original.  Since  is  a  relative  utilization  vector, 

this  multiplication  by  a  constant  does  not  affect  the  solution. 

In  the  next  section  we  investigate  an  approximation  involving  the  aggrega¬ 
tion  of  classes  in  different  routing  chains.  Although  the  properties  described 
above  do  not  hold  in  these  cases,  we  show  that  an  approximation  procedure 
analogous  to  that  given  in  this  section  for  classes  in  the  same  chain  gives  rea¬ 
sonably  accurate  results. 

4.3  AGGREGATION  OF  CLASSES  IN  DISTINCT  CHAINS 

i 

Consider  the  set  of  closed,  separable  queueing  networks.  Suppose  we  want 
to  reduce  the  complexity  of  the  net-work  by  aggregating  classes.  The  straight¬ 
forward  approach  is  to  follow  the  technique  given  in  section  4.3  for  the  aggrega¬ 
tion  of  classes  in  the  same  routing  chain. 

We  assume  a  queueing  network  QN  with  R  classes,  each  in  a  distinct  routing 
chain,  M  service  centers,  and  Nr  customers  in  class  r.  Each  service  center  may 
be  load  dependent  as  a  function  of  the  total  customer  residency  at  the  center. 
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We  wish  to  speed  up  the  solution  process  by  transforming  QN  into  an  approxi¬ 
mately  equivalent  network  with  A  customer  classes,  1^4  </?,  solving  Q$,  and 
approximating  the  solution  of  QN  by  simple  functions  of  the  solution  of  QN. 


Let  i4,  be  a  mapping  from  classes  of  onto  a  partition¬ 

ing  of  the  classes  of  QN,  and  let  F*  :(l, )-»(l,...,i4  )  be  the  inverse  relationship. 
F(a)  is  therefore  the  set  of  classes  in  QN  that  are  represented  by  aggregate 
class  a  in  Q$,  and  F'  (r)  is  the  aggregate  class  of  Q$  to  which  class  r  of  QN 
belongs.  Define 

=  S  a  =  l  ,...,A 

r  e  F( a) 

is  the  multiprogramming  level  of  aggregate  class  a  in  Qft. 

We  let  Q$  have  the  same  number  of  service  centers  as  QN,  and  we  let  each 
service  center  have  the  same  load  dependent  function  as  in  QN  (remembering 
that  the  load  dependent  rates  can  depend  only  on  the  total  number  of  custo¬ 
mers  at  the  center).  To  complete  the  specification  of  QN  we  need  only  define 
the  relative  utilization  vectors  of  the  A  classes. 

The  customers  of  aggregate  class  a  represent  the  sum  of  the  service 
requirements  of  the  classes  F( a)  of  QN.  The  utilization  of  each  center  of  QN  by 
class  a  should  therefore  equal  the  sum  of  the  utilizations  of  the  equivalent 
centers  of  QN  by  classes  F(a).  This  means  that  aggregate  class  a  has  a  relative 
utilization  vector  •whose  elements  are  in  the  same  proportion  as  the  sum  of  the 
utilizations  at  each  center  of  classes  F(a)  in  QN.  A  natural  choice  for  the  rela¬ 
tive  utilization  of  class  a  at  center  m  in  Q$,  %a.m>  therefore  is 


X 


a,m 


n 

r  e  F( a) 


U. 


r.m 


a  =  1  ,...,A 
m  - 


where  Urm  is  the  utilization  of  class  r  at  center  m  in  QN.  (The  problem  of 
estimating  Xarn  without  solving  QN  to  obtain  the  Urm  will  be  treated  in  section 
4.5.)  Notice  that  this  choice  of  the  ensures  that  the  utilizations  of  the 
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centers  in  Q$  are  in  the  same  proportion  to  each  other  as  the  total  utilization  of 
those  centers  by  the  R  original  classes  (although  these  quantities  may  not  be 
equal).  Notice  also  that  since  the  are  relative  utilizations,  we  actually  need 
require  only  that  the  vectors  (UTi  1(  Ur>  Urjj)  be  in  the  correct  proportion  to 
each  other,  and  not  that  they  be  the  absolute  utilizations. 

This  procedure  for  aggregating  classes  is  really  an  extension  of  the  process 
undertaken  by  the  system  analyst  in  deriving  parameter  settings  for  a  single 
class  model  from  measurement  data.  In  those  cases  one  typically  apportions 
the  total  utilization  of  the  devices  equally  among  the  jobs  that  complete  during 
the  measurement  interval.  This  is  equivalent  to  summing  the  utilizations  of  the 
devices  for  each  potential  customer  class  of  a  multiclass  model  and  creating  a 
single  aggregate  class,  which  is  exactly  our  class  aggregation  technique.  That 
this  aggregation  is  the  natural  technique  was  pointed  out  in  [BCMP  75],  although 
their  purpose  in  presenting  that  specific  many-to-one  aggregation  was  quite 
different  than  ours  here.  Sauer  and  Chandy  [75]  also  mention  application  of  a 
specific  kind  of  class  aggregation,  many-to-three,  in  a  central  server  context, 
but  again,  their  purpose  was  quite  different  than  ours. 

Figure  4.1  -  Class  Aggregation  Technique 

QM  _ Qfi 

M  #  Centers 

R  #  Classes 

Nr  MPT, 

Y  Relative 

r’m  Utilizations 

drain)  Load  Dependent 
Rates 

Figure  4.1  summarizes  the  construction  of  Qft.  Once  the  aggregate  model 
Q$  has  been  formed  using  the  transformation  described  above,  it  can  be  solved, 


M 

R 


Va  =  S 

r  e  f  (a) 

■^a  ,m  ~  XI  U r,m 

r  t  F{ a) 

drain) 
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and  class  dependent  throughputs,  ?V<m,  utilizations,  0T<m,  and  mean  queue 
lengths,  @ri7n,  obtained.  The  performance  measures  of  the  original  classes  of  QN 
can  be  approximated  from  those  obtained  from  the  solution  of  in  the  same 
manner  as  described  in  section  4.3;  that  is,  the  mean  performance  measures  at 
center  m  for  aggregate  class  a  are  divided  among  the  classes  in  .F(a)  in  propor¬ 
tion  to  each  class’  share  of  the  utilization  at  center  m.  Specifically,  we  use 

Pt  =  .XriHL  n 

u  r.m  u  oLrn 

■"•0,771 

fur,m  rfs 

r.m  ~  Vo  1  a,m 
2-i  e;'.7n 

3  e  F(a) 

0  =  a 

**  r.m  h  a.m 

A  a,m 

where  a  =  F' (r)  and  Ur>m,  Tr>m,  and  Qr>m  denote  the  approximations  to  the  true 
values  Urm,  Trm  and  Qrm.  The  approximation  for  Trm  is  exact  if  the  perfor- 
mancc  measures  of  QN  are  identically  the  aggregate  of  the  measures  of  QN. 

Unfortunately,  the  solution  of  will  not  generally  correspond  exactly  to 
that  of  QN.  They  differ  because  the  state  space  of  Qft  is  not  an  exact  represen¬ 
tation  of  that  of  QN.  For  example,  given  a  two  class  network  with  three  custo¬ 
mers  in  each  class,  consider  any  state  of  the  corresponding  aggregate  network 
with  four  customers  at  the  first  service  center  (which  is  assumed  to  be  load 
independent).  The  factor  in  the  state  probability  corresponding  to  center  1  hav¬ 
ing  4  customers,  F1(4),  is  given  by 

F,(4)  =  5-f 

=  (Uui+U^i)* 

=  Uti  +  UJhUt.1  +  auUuli  +  4171,(71,  +  u£i 

The  first  and  last  terms  correspond  to  states  with  four  class  1  and  four  class  2 
customers  are  at  center  1  respectively.  As  only  three  of  each  class  exist,  these 
states  of  Qft  represent  states  that  are  impossible  in  QN. 

This  example  points  out  that  the  aggregate  network  introduces  extraneous 


-  56  - 


states  whenever  the  number  of  class  a  customers  at  any  center  is  larger  than 
the  multiprogramming  level  of  any  of  the  classes  in  F(a)*.  Our  hypothesis  is 
that  these  extra  states  will  be  generated  in  such  a  way  that  gross  performance 
measures  such  as  utilization  and  mean  queue  length  will  not  be  significantly  in 
error. 


Table  4.1  -  Example  Model 

Na  =  2  MB  =  2 

XA  =  (  3  2)  **  =  (12) 

UA  =  (.495  .330)  UB  =  (.269  .537) 

Qa  =  (1.05  .951)  Qb  =  (.635  1.32) 

As  a  simple  example  of  the  aggregation  procedure,  consider  a  model  of  a 
computer  system  with  two  classes  of  jobs.  Class  A  is  CPU  bound  (higher  relative 
utilization  at  center  l),  while  class  B  is  10  bound  (higher  relative  utilization  at 
center  2).  The  model  parameters  and  solution  are  given  in  table  4.1.  The 
corresponding  aggregate  model  (formed  using  the  utilizations  from  the  exact 
solution)  and  its  solution  are  given  in  table  4.2.  The  predicted  throughputs  are 
within  3%  and  the  queue  lengths  within  11%  of  the  exact  solution. 

Table  4.2  -  Aggregate  Model 
$  =  4 

2  =  (.7635  .8670) 

0  =  (.7462 .8474) 

Q  =  (1.748  2.253) 

UA  -  (.484 .323)  0B  -  (.262  .525) 

Qa  =  (1.133.858)  =  (-615  1.395) 

Intuitively,  we  expect  the  approximation  procedure  to  be  good  when  the 
sum  of  the  probabilities  of  these  extraneous  states  is  small.  We  should  also 
expect  the  aggregation  technique  to  be  exact  when  the  classes  being  aggregated 

are  really  the  same  class,  i.e.,  when  the  vectors  Ur  m  are  all  Nr  times  some  single 

*  The  aggregate  state  space  represents  all  states  of  QN,  as  well  as  extraneous  states  not  ap¬ 

pearing  in  QN. 
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vector*.  In  the  next  section  we  investigate  the  error  behavior  more  precisely, 
and  introduce  analytic  error  bounds  for  the  approximation. 

4.4  ANALYTIC  ERROR  BOUNDS 

In  this  section  we  investigate  analytic  error  bounds  for  the  class  aggrega¬ 
tion  procedure.  Our  measure  of  error  is  the  ratio  of  the  utilizations  predicted 
by  the  class  aggregation  technique  to  those  of  the  exact  solution.  The  bounds 
developed  pertain  to  the  idealized  class  aggregation  technique  of  section  4.4; 
that,  is,  they  hold  in  those  situations  where  the  absolute  utilizations  of  the  model 
are  already  known.  The  bounds  can  also  be  viewed  as  limits  on  the  error  caused 
by  creating  a  single  class  model  from  measurement  data  about  a  system  with 
multiple  customer  classes. 

Two  error  bounds  are  developed.  The  first  has  the  desirable  property  that 
it  bounds  the  error  to  0%  when  all  classes  being  aggregated  are  statistically 
identical.  The  second  set  of  bounds  do  not  have  this  property,  but  are  tighter 
than  the  first  in  general. 

Consider  a  network  QN  with  R  classes,  Nr  customers  in  class  r,  and  M  ser¬ 
vice  centers.  Let  Urm  be  the  utilization  of  class  r  at  center  ?n.  Finally,  let 
am( n )  be  the  load  dependent  speed  factor  of  center  m  as  a  function  of  its  total 
queue  length. 

We  wish  to  aggregate  the  R  classes  of  QN  into  the  A  aggregate  classes  of  Q$. 
For  the  purpose  of  deriving  an  error  bound,  this  is  done  in  the  following  manner. 
Let  F(a)  be  a  set  of  classes  of  QN  that  are  to  be  aggregated  in  Qfv.  A  new  model, 
QN',  is  formed  that  is  functionally  identical  to  QN  but  with  aggregate  class  a  of 

Qft  represented  in  QN'  by  |A'(a)|  identical  classes  a’j,  j- 1,...,  ]F1(a)|.  The 

*If  all  classes  of  QN  comprising  aggregate  class  a  arc  statistically  identical,  the  extraneous 
states  formed  are  equal  to  terms  which  are  valid.  Hie  net  effect  of  these  states  then  is  to 
scale  the  normalizing  constant  by  some  factor  C,  which  does  not  alter  the  solution. 
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multiprogramming  level  of  class  a',-,  $a',  is  set  equal  to  the  multiprogramming 

J  j 

level  of  the  jth  class  of  F(a)  (for  an  arbitrary  ordering  of  those  classes)  in  QN. 

We  assume  that  the  ratios  of  the  utilizations  by  each  class  is  known,  that  is, 
the  relative  utilizations  of  the  classes  in  QN  are  given  by 


y  —  r+iT 
r.m  —  c 

where  c  is  an  arbitrary  (non-negative)  constant.  Define  =  XI  Xrm.  This 

r  t  F( a) 

is  the  relative  utilization  vector  for  aggregate  model  Q$.  The  relative  utilization 
of  aggregate  class  a’;-,  X'CL-.rrL,  is  defined  as 


N 


X ' 


ai 


ai‘m  “  !F(a)| 


^  a,  m  FTl  —  1 , . . . ,  M 


i  =  l 


The  result  of  this  procedure  is  to  create  an  R  class  network  QN,  where  the  R 
classes  are  partitioned  into  A  groups,  each  of  which  represents  one  aggregate 
class  of  Qft. 

The  utilization  of  center  m  by  class  a  in  Q$  expressed  in  terms  of  the  solu¬ 
tion  of  QN'  is  given  by 


On  m  =  X „<  m 

0,771  a  yVX 


\F(a)\ 


N 


k  =  1 


ak 


G'  (0)  Na’, 


Here  we  are  simply  inflating  the  utilization  of  one  of  the  |  jF*  (cz  )  |  identical  classes 
of  QN'  to  account  for  the  customers  in  the  other  classes. 

The  utilization  of  class  r  at  center  m  in  QN  is  approximated  by 


U 


Lr,m 


r,m 


S  X. 
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o,m 


3  eF(a) 


j  ,m 


*r. 
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3  e  F {a) 


Xn' 


a  j,m 


(?(#) 


\F(a)\ 


k  =  l 


-V.', 
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*,.»  0  C'0-4a-t) 

e  xim  c-m 

j  cF{a) 


C’(tf -?«■,) 

_jrr-,n  G-m 

where  a  -  F'  (r),  and  a  'j  is  the  jth  aggregate  class  of  QN '  corresponding  to 
aggregate  class  a  of  Q$. 

Consider  the  ratio  Er,m - Or,m/Ur, m •  This  is  the  ratio  of  the  predicted  utili¬ 
zation  to  the  actual  utilization.  We  can  express  the  actual  utilization  Urm  as 


Ur.m  ~  Xr  m 


Therefore, 


C{H) 


Xr.m  G'  (Af  8a'j )  Q  (/?  j 

=  C'W)  XnmC0-gr) 
-  —A  c(jl} 

G(N-er)  G'  (/v  ) 


Each  of  the  terms  of  G  and  G‘  above  is  a  summation  over  all  possible  states  given 
the  appropriate  customer  population. 


Each  of  the  quotients  in  (4.1)  has  the  same  number  of  terms  in  the  numera¬ 
tor  and  denominator.  Further,  each  term  is  composed  of  M  multinomial 
coefficients  multiplied  by  powers  of  terms  of  the  relative  utilization  vectors.  We 
can  take  advantage  of  these  facts  when  obtaining  an  error  bound.  We  do  this  by 
bounding  the  term-by-term  ratios  of  each  quotient  independently,  thus  bound¬ 
ing  the  entire  quotient. 

Using  this  approach  we  have 


Er%m  £  maximum  of  the  term-by-term  ratios 


=  MAX 

hi . iRM 


(X'u/Xn )<u  ■  •  •  (rm/XRU)im  ■ 


-  60  - 


*  MAX 
i  11 . J'ru 


(xu/ru)ln  ■  ■  ■  Vcm/x'su'f**\ 


where  the  and  j m  represent  choices  for  the  number  of  class  r  customers  at 
center  m.  This  is  maximized  when  all  customers  of  class  r  are  at  the  center  for 
which  class  r  has  the  largest  share  of  the  utilization  for  the  first  term,  and  when 
all  customers  are  at  the  center  with  the  smallest  share  for  the  second  term. 
(We  note  here  that  if  any  class  does  not  visit  a  center  that  is  visited  by  other 
classes  in  the  same  aggregate  then  Erm  is  unbounded).  Letting  m?  and  mf  be 
these  two  centers  respectively  for  class  r, 


E, 


R 

r,m  —  TT 
r  —  1 


X'  + 

X  - 

r,m/ 

r,mT 

X  + 

X  . 

r.m/ 

r,mT 

X'  _ 

X  + 

r,mT 

r,7nr 

X  _ 

x  + 

r.TTly. 

r.m * 

A  lower  bound  is  found  by  a  similar  process  to  be 

.  -  N. 

R 

Er.m  —  IT 

r  =  1 

This  bound  is  independent  of  both  the  class  and  center  for  which  the  error 
is  being  bounded.  For  load  independent  networks,  this  is  expected  since  the 
utilizations  of  all  classes  in  |A’(a)|  are  in  fixed  proportion.  Therefore,  an  error 
in  any  one  of  them  indicates  the  same  percentage  error  in  the  others. 


The  bound  has  two  other  nice  features.  First,  it  reduces  to  one  (i.e.,  no 
error)  if  all  classes  are  really  the  same  class.  Second,  the  bound  is  most  sensi¬ 
tive  to  the  class  with  the  largest  number  of  customers,  since  it  is  the  one  with 
the  highest  exponent  in  the  bound.  This  agrees  with  our  intuition,  since  a  single 
class  with  many  customers  will  tend  to  dominate  the  performance  measures. 


Although  the  above  formulae  bound  AV.m,  they  are  very  crude  bounds  in 
cases  where  the  classes  being  aggregated  are  substantially  different.  We  next 
derive  bounds  for  load  independent  networks  that  are  somewhat  tighter  in  gen¬ 
eral,  although  they  do  not  reduce  to  one  in  the  case  of  aggregation  of  identical 
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classes. 

Consider  aggregating  all  R  classes  of  a  load  independent  network  QN  into  a 
single  class  with  ft  =  XX  customers,  and  solving  the  resulting  model,  QN.  A 

r 

particular  choice  for  the  relative  utilization  vector  of  the  aggregate  class  is 
given  by* 

=  E  u™ 

T~  i 

The  utilization  of  center  m  by  class  r  predicted  by  Qfj ,  Ur^m,  is  then  approxi¬ 
mated  by 

ff  _  &r,m  <r> 
u  r,m  "  R 

k  =  l 

”  ^r,m 


~mr 


That  is,  it  is  the  proportion  of  the  total  utilization  of  center  m  attributed  to  class 


r  in  an  exact  solution.  Er<m  is  defined  as  before  to  be  the  ratio  of  the  approxima¬ 


tion  to  the  true  utilization.  Therefore, 


p  _  r,tn 

”  rr 

ur,m 

C(ft-i) 
~  £($) 

We  can  bound  Er  m.  by 


E 


r,m 


£  MAX 
X 


G(N-l) 
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and 
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r,m 


>  MIN 
x 


W) 


Bounds  are  easily  derived  by  considering  the  meaning  of  the  expression. 
'ls  the  expression  for  system  throughput  when  there  are  i?  customers  in 


*Any  positive  constant  times  X  is  also  a  valid  relative  utilization  vector.  We  have  chosen  this 
particular  vector  for  notational  convenience. 
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Qft.  From  asymptotic  and  bottleneck  analysis  ([Denning  and  Buzen  78])  we 
know  that  system  throughput  is  bounded  by 


%ylsmN 


1  ft 
ft  max  N 8 
1 


-MAX 


where  ft  max  i-s  the  relative  utilization  of  the  bottleneck  service  center  of  Qft,  and 


M 

AT*  _  m=1 

N  ~  T - 


MAX 

Further,  we  know  that  system  throughput  must  increase  as  we  decrease  the 
total  service  requirement  at  any  device.  System  throughput  is  therefore 
bounded  above  by  the  throughput  observed  when  all  entries  in  the  relative  utili¬ 


zation  vector  ft  are  reduced  to  ftjiiN  —  MfN'ftm 


m 


For  such  a  balanced  system  the 


throughput  is  easily  computed  as 


X 

M+ft-2 

M-l 

1 

ft 

ft  MIN 

M+ft-1 
{  M-l 

ft  MIN 

M+ft-1 

The  quotient  therefore  bounded  above  by* 


%?■ s  U!N 


1  ft 

^ MAX  N* 
%MAX 
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&MW 


ft 

M+ft-1 


A  lower  bound  for  system  throughput  can  be  calculated  by  a  similar  tech¬ 
nique.  System  throughput  must  decrease  as  the  total  service  requirement  at  a 


center  is  increased.  So,  system  throughput  is  bounded  below  by  that  of  the 

*[Au]  has  shown  that  the  throughput  of  QN  is  in  fact  bounded  above  by  the  throughput  of  a 
balanced  network  with  the  relative  utilization  at  each  center  equal  to  the  average  of  the  rela¬ 
tive  utilizations  of  QN.  This  result  can  be  incorporated  here  by  replacing  Kyis  by  XAvq,  the 
average  of  the  relative  utilizations. 
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system  with  all  entries  in  the  relative  utilization  vector  raised  to  equal  that  of 
the  bottleneck  service  center.  This  gives 


At  the  same  time,  the  sum  of  the  utilizations  of  all  devices  must  be  bounded 
below  by  one  (which  is  achieved  only  if  the  customer  population  consists  of  a  sin¬ 
gle  customer) 


M  ^ 

s* 

m*  1 
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S(i))  - 


g .. 
e<W)-  a 


m  =  l 

We  have  therefore  that 


Combining  the  upper  and  lower  bounds 
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(4.2) 


Er.m  —  MIN 
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The  inequality  expressed  in  (4.2)  shows  that  the  aggregation  procedure  is 
exact  in  The  case  where  the  network  consists  of  a  single  service  center,  since  in 
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that  case  %max  ~  1-  We  therefore  expect  the  aggregation  procedure  to  be 
asymptotically  exact  as  any  one  center  of  a  network  becomes  more  and  more  of 
a  bottleneck,  since  in  those  cases  the  network  acts  like  a  one  center  network. 
The  bounds  also  show  that  the  approximation  is  asymptotically  correct  as  the 
customer  population  $  grows. 

For  less  extreme  networks,  the  bounds  may  be  quite  loose.  For  the  exam¬ 
ple  network  listed  in  table  4.1,  the  first  set  of  bounds  give 

.111  ^  Erm  £  9.000 
while  equation  (4.2)  gives 

.923  g  Er<m  ^  1.048 

The  tighter  bounds  restrict  the  possible  error  between  an  8%  underestimate  and 
a  5%  overestimate.  The  actual  error  (shown  in  table  4.2)  is  an  underestimate  of 
about  2.3%. 

The  error  bounds  of  equation  (4.2)  can  be  improved  by  extending  the  con¬ 
cept  of  bounding  system  throughput  by  the  throughputs  of  two  balanced  net¬ 
works.  One  of  the  terms  of  the  lower  bound  of  (4.2)  is  derived  by  setting  all  rela¬ 
tive  utilizations  in  QN  to  Xuax>  the  relative  utilization  of  the  bottleneck  center. 
A  simple  formula  gives  the  throughput  of  the  resulting  model,  which  is  a  lower 
bound  on  the  throughput  of  Qft. 

We  can  trade  the  simplicity  of  the  throughput  formula  for  a  balanced  net¬ 
work  for  increased  accuracy  of  the  bound  by  assuming  that  centers  1  to  m  all 
have  relative  utilization  J£m,  where  the  centers  are  ordered  so  that  Jtj  ^  Xj+\- 
The  resulting  network  has  a  balanced  subsystem  with  m  centers,  the  solution  of 
which  is  easy  to  compute,  and  M-m  unbalanced  centers,  which  more  accurately 
reflect  the  behavior  of  QN. 

For  example,  we  can  bound  the  throughput  of  Q$  by  letting  the  relative 
utilizations  of  all  centers  but  the  bottleneck  be  AT/-U  The  throughput  of  the 
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resulting  system  is 


This  can  be  computed  in  time  0(N)  using  Buzen’s  algorithm  and  a  Cjf-i  vector 
representing  the  solution  of  the  balanced  subsystem.  Similar  computations  can 
be  carried  out  in  time  0  ((M  —m)N)  for  other  choices  of  m. 

Empirical  evidence  has  shown  that  the  class  aggregation  technique  always 
underestimates  throughputs  and  utilizations  for  load  independent  networks.  We 
expect  this  to  be  the  case  since  the  effect  of  aggregating  two  distinct  classes  is 
to  make  all  of  their  customers  statistically  identical.  This  means  that  the  custo¬ 
mers  in  the  two  classes  are  more  likely  to  collide  when  aggregated  than  when 
they  had  different  behavior  patterns.  In  appendix  B  we  prove  that  the  approxi¬ 
mation  is  an  underestimate  for  networks  with  one  customer  in  each  of  two 
classes  and  two  load  independent  service  centers.  We  conjecture  that  the  result 
holds  for  all  load  independent  networks. 

4.5  ITERATIVE  APPROXIMATION 

It  is  often  the  case  that  neither  the  absolute  utilizations  nor  the  ratios  of 
utilizations  of  devices  by  the  different  classes  are  known.  Indeed,  this  situation 
arises  whenever  one  is  doing  performance  prediction.  In  these  instances  we 
cannot  directly  apply  the  class  aggregation  procedure  described  in  the  previous 
sections  because  the  correct  ratios  of  the  utilization  vectors  are  required  to 
form  the  approximate  model  Q$ ,  and  these  are  not  known.  In  other  words, 


although  it  is  the  case  that  ™ — 

4 r.m ’  r,m' 


,  it  is  not  necessarily  true  that  — 


.  It  is  therefore  not  reasonable  to  sum  the  relative  utilization  vectors  of 


&  r\m 
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the  classes  in  each  aggregate  to  form  the  relative  utilization  vector  of  the  aggre¬ 
gate  class. 


Because  we  cannot  know  the  required  ratios  before  solving  QN,  which  is 
what  we  are  trying  to  avoid,  an  iterative  construction  of  Q$  is  used.  Each  step 
of  the  iteration  involves  a  solution  of  the  aggregate  class  model,  with  the  ratios 
of  the  relative  utilization  vectors  of  QN  chosen  according  to  information  gath¬ 
ered  from  the  previous  solution  of  Q$. 


The  iteration  is  initialized  by  setting  A\m=  X)  Wr*Xr<m,  where  (XTt  i t...,Xrjt) 

r  c  F{a.) 

is  an  arbitrary  relative  utilization  vector  for  class  r  of  QN,  and  W  is  a  vector  of 
length  R  initially  set  to  all  ones.  The  value  of  each  Wr  is  adjusted  in  successive 
steps  of  the  iteration  to  alter  the  ratios  of  the  utilization  vectors  that  compose 
each  class  of  Qft .  One  can  think  of  the  WT  as  a  vector  listing  the  system 
throughputs  of  the  R  classes  of  QN. 

At  each  step  of  the  iteration  Qft  is  solved  with  the  current  set  of  3ca ,m- 
Aggregate  class  queue  lengths  are  obtained  from  this  solution.  An  approxi¬ 
mation  for  the  mean  queue  length  of  class  r  at  center  m  in  QN  is  computed  by 


Wr*Xr 

Q  r,m  ~ 


a ,m 


(4.3) 


a,m 


where  a  =  F’(r).  We  note  that  (4.3)  is  exact  if  the  classes  in  F( a)  compose  a 
single  routing  chain,  and  center  m  is  load  independent.  It  is  also  exact  if  all 
classes  in  aggregate  class  a  are  statistically  identical  (i.e.,  have  relative  utiliza¬ 
tion  vectors  that  are  scalar  multiples  of  each  other). 

rvi 

If  we  sum  the  over  all  centers,  we  obtain  the  predicted  multiprogram¬ 
ming  level  for  class  r,  S?r,  as  observed  in  the  aggregate  class  approximation: 

rv,  X.  r\j 

X r  =  Q  r,m 

m-i 

r*j 

Nr  will  not  in  general  equal  the  true  multiprogramming  level  Nr.  We  therefore 
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adjust  the  weight  associated  with  the  relative  utilization  vector  for  class  r,  Wr,  in 
an  attempt  to  increase  or  decrease  class  r’s  predicted  multiprogramming  level. 

We  have  tried  two  different  methods  for  adjusting  the  weights,  a  discretized 
Newton  iteration  [Ortega  and  Rheinboldt  70],  and  a  heuristic.  The  discretized 
Newton  iteration  can  be  described  as  follows.  We  wish  to  find  if  such  that 

G{f)  =  F{f)~  ft  =  0 

where  ft  is  the  vector  of  multiprogramming  levels  and  F{$)  is  the  vector  of  mul¬ 
tiprogramming  levels  predicted  by  solving  the  aggregate  network  formed  using 
weights  if.  Newton  iteration  for  this  process  is 

#-{n+i)  =  #Kn)  _  /(#(«))-!(; (yf<n>) 

where  J is  the  Jacobian  of  G  evaluated  at  ff^ n\  Since  it  is  clearly  infeasible 
to  compute  J  exactly,  an  estimate  was  computed  according  to 


fs 


ggqf)< 

dWj 

G(Win\. 


1.001  . Wjn')i  -  G ( if(n) Whn)) i 

.001  W}n) 


This  requires  R  solutions  of  Qft  for  each  change  to  if,  and  so  can  be  quite  expen¬ 
sive*. 


This  iteration  scheme  converged  fairly  slowly  for  many  networks  tested.  It 
diverged  persistently  for  networks  where  two  or  more  classes  that  are  strongly 
bottlenecked  at  distinct  centers  are  aggregated.  For  these  networks  G{W)  can 
change  significantly  with  small  changes  in  if.  The  changes  made  at  each  itera¬ 
tion  tend  to  be  overestimates,  and  G{^r^)  oscillates  without  converging. 
Instead  of  implementing  more  complicated  variations  of  Newton  iteration  in  an 
attempt  to  resolve  these  problems,  we  decided  to  use  the  heuristic  method, 
which  is  both  cheaper  and  seems  to  converge  more  quickly. 


*It  is  possible  to  reduce  the  computation  required  by  re-evaluating  the  approximation  to  J 
only  every  few  iterations,  instead  of  every  iteration. 
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The  heuristic  solution  used  was 


wSn+l) 


Nr 

a  +  Y; 


W/n) 

1+a 


where  a  is  an  arbitrarily  chosen  weight.  If  a  is  chosen  to  be  zero,  (4.4)  reduces 
to  a  modification  used  by  Chandy  et  al.  [75]  in  an  iterative  approximate  solution 
technique  for  single  class,  non-separable  networks.  In  general,  we  have  used 
a  =  1  with  success.  For  most  networks  this  produces  linear  convergence,  with 
\Nr-$$n)  |  being  approximately  halved  at  each  step  of  the  iteration.  For  some 
networks  where  the  classes  forming  an  aggregate  are  heavily  bottlenecked  at 
distinct  centers,  larger  a  (e.g.,  a  =  5)  are  required  to  achieve  convergence. 
Again,  convergence  appears  to  be  linear.  The  heuristic  is  preferable  to  the 
discretized  Newton  iteration  both  because  of  its  simplicity  and  low  computa¬ 
tional  expense  at  each  step,  $nd  because  empirical  evidence  indicates  that  it 
converges  faster. 


The  convergence  criterion  used  in  our  examples  was  that  \Nr  j  <  £ 

for  all  classes  r.  This  is  perhaps  a  bit  conservative,  since  moderate  inaccuracies 
in  the  may  have  very  little  effect  on  the  predicted  performance  measures. 
An  alternative,  therefore,  is  to  iterate  until  j  Tr  -  7^n*|  <  e  for  all  r.  The  entire 
iteration  procedure  is  summarized  in  table  4.3. 

It  is  not  possible  to  apply  the  analytic  error  bounds  derived  in  section  4.5  to 
the  results  obtained  by  the  iterative  procedure,  since  we  do  not  know  the  true 
ratios  of  the  utilization  vectors.  However,  approximations  can  be  calculated  by 
assuming  that  the  ratios  of  the  utilizations  predicted  by  the  aggregate  model 
are  correct. 


Since  the  analytic  error  bounds  are  too  loose  in  general  to  be  good  indica¬ 
tions  of  the  true  error,  we  next  present  a  number  of  examples  of  solutions 
obtained  by  the  iterative  technique.  These  examples  are  followed  by  a  discus- 
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Table  4.3  -  Iterative  Class  Aggregation  Technique 

GIVEN: 

Xr<m  -  relative  utilization  of  class  r  at  center  ra. 

Nr  -  multiprogramming  level  of  class  r. 


INITIALIZE: 

rr(0)  =  1.  r  =  1,  2 . R 

#«  =  S  Nr 

r  e  F(a) 


ITERATION  STEP: 


DO  n=0  BY  1  UNTIL  |ATr  -  <  e  for  all  classes  r: 

Set  Xa,m  =  S  Wr™*Xr.m 
^  r  t  F( a) 

Solve  QN 

Compute  mean  queue  lengths  Qa,m 


Qr.m  ~ 


wSn)x< 


r.m 


@P(r). 


m 


M 


ffr  =  E 
m~l 


m 


wr(n+1) 


(X  + 


wr{n) 

1  +  a 


sion  of  the  empirical  error  behavior  of  the  approximation,  and  some  thoughts  on 
the  causes  of  the  inaccuracies. 


4.6  EXAMPLES 

Table  4.4  lists  the  results  of  applying  the  iterative  class  aggregation  tech¬ 
nique  to  a  number  of  examples.  (Tabic  4.4  also  contains  the  results  of  the  custo¬ 
mer  aggregation  technique.  These  are  discussed  in  the  next  chapter.)  In  each 
case,  we  aggregated  all  classes  into  one.  This  represents  the  cheapest,  and 
probably  least  accurate,  aggregation,  and  so  gives  a  lower  bound  on  cost  and 
upper  bound  on  error  relative  to  other  possible  aggregations. 
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Table  4.4  -  Multiple  Class  Examples 

This  table  presents  the  results  of  applying  the  class  and  customer  aggrega¬ 
tion  techniques  to  a  number  of  multiple  class  networks.  Each  network  has 
been  solved  for  a  number  of  customer  populations.  We  list  performance 
measures  for  only  those  centers  that  have  the  largest  relative  utilization  for 
at  least  one  of  the  classes.  (Some  queue  lengths  for  solutions  obtained  by 
customer  aggregation  have  been  omitted  due  to  a  software  limitation.) 

EXAMPLE  A: 


=  (  1  -5 

.5  .5  .5  .5 

.5  .5  .b 

.8  .8  .8  .8  . 

8  .8  .8  .8) 

XB  =  (.5  .5 

.5  .5  .5  .5 

.5  .5  .8 

CO 

CO 

CO 

CO 

,8.8.8  1) 

Exact  Solution 

"  Na  =  32 

Ng  =  '62 

Utilizations 

Queue  Lengths 

Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.5455 

.2854 

.8109 

2.634 

1.336 

3.971 

C16 

.4364 

.5308 

.9672 

7.004 

8.299 

15.303 

Class  Aggregation  Solution 

Utilizations 

Queue  Lengths 

Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.5442 

.2656 

.8098 

2.737 

1.340 

4.076 

C16 

.4354 

.5311 

.9665 

6.914 

8.462 

15.376 

Customer  Aggregation  Solution 

Utilizations 

Qu 

eue  Lengths 

Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.5385 

.2619 

.8004 

C16 

.4308 

.5237 

.9545 

7.061 

8.366 

15.427 

Exact  Solution  ■ 

-  Na  =  62  Nb  =  2 

Utilizations 

Queue  Lengths 

Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.9750 

.0193 

.9943 

25.660 

.535 

26.195 

C16 

.7800 

.0387 

.8187 

4.191 

.198 

4.388 

Class  Aggregation  Solution 

Utilizations 

Queue  Length 

s 

Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.9749 

.0191 

.9940 

25.719 

.501 

26.220 

C16 

.7799 

.0382 

.8181 

4.191 

.204 

4.396 

Customer  Aggregation  Solution 

Utilizations 

Queue  Length 

s 

Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.9587 

.0188 

.9775 

CIS 

.7669 

.0376 

.8046 

3.879 

.182 

4.061 

Exact  Solution  ■ 

-  Na  —  8  Nd  =  3 

Utilizations 

Queue  Length 

s 

Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.3741 

.1863 

.5609 

.765 

.400 

1.165 

C16 

.2993 

.3736 

.6728 

.794 

.971 

1.766 
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Class  Aggregate  Solution 


Utilizations  Queue  Lengths 


Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.3736 

.1867 

.5603 

.7860 

.3930 

1.1790 

C16 

.2989 

.3734 

.6723 

.7854 

.9817 

1.7671 

Customer  Aggregation  Solution 

Utilizations 

Queue  Lengths 

Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.3674 

.1035 

.5509 

C16 

.2939 

.3669 

.6608 

.786 

.960 

1.746 

Exact  Solution  - 

Na  -  14  NB  =  2 

Utilizations 

Queue  Lengths 

Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.6479 

.0473 

.6952 

1.773 

.137 

1.910 

C16 

.5183 

.0946 

.6129 

1.209 

.216 

1.425 

Class  Aggregate 

Solution 

Utilizations 

Queue  Lengths 

Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.6480 

.0470 

.6950 

1.794 

.130 

1.924 

C16 

.5184 

.0940 

.0124 

1.207 

.218 

1.424 

Customer  Aggregate  Solution 

Utilizations 

Queue  Lengths 

Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.8386 

.0460 

.6826 

C16 

.5093 

.0920 

.6013 

1.168 

.206 

1.374 

EXAMPLE  B: 

XA  —  (1  .25  .25  .25  .25  .25  .25  .25  .25  .5  .5  .5  .5  .5  .5  .5) 
XB  -  (.5  .5  .5  .5  .5  .5  .5  .5  .8  .8  .8  .8  .8  .8  .8  l) 

Exact  Solution  -  NA  =  32  NB  =  32 


Utilizations  Queue  Lengths 


Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.7107 

.2919 

.9937 

14.144 

6.388 

20.533 

C16 

.3509 

.5839 

.9347 

3.923 

6.085 

10.007 

Class  Aggregate  Solution 

Device 

Utilizations 

Class  A  Class  B 

Total 

Queue  Lengths 

Class  A  Class  B  Total 

Cl 

.6931 

.2902 

.9833 

14.595 

6.128 

20.723 

C18 

.3465 

.5803 

.9269 

3.908 

6.563 

10.470 

Customer  Aggregate  Solution 

Device 

Utilizations 
Class  A  Class  B 

Total 

Queue  Lengths 

Class  A  Class  B  Total 

Cl 

.6317 

.2893 

.9210 

C16 

.3158 

.5786 

.8944 

3.392 

5.797 

9.189 

Exact  Solution  -  NA  -  62  Na  ~  2 
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Utilizations  Queue  Lengths 


Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.9778 

.0224 

1.000 

52.116 

1.206 

53.322 

C 16 

.4883 

.0448 

.5336 

1.046 

.093 

1.139 

Class  Aggregate  Solution 

Utilizations 

Queue  Length 

s 

Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.9778 

.0221 

.9999 

52.148 

1.171 

53.319 

C16 

.4889 

.0443 

.5332 

1.047 

.094 

1.141 

Customer  Aggregate  Solution 

Utilizations 

Queue  Lengths 

Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.9611 

.0213 

.9824 

C16 

.4805 

.0425 

.5230 

1.168 

.098 

1.266 

Exact  Solution 

-  Na  =  8  Nb  =  8 

Utilizations 

Queue  Lengths 

Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.5730 

.1881 

.7612 

1.690 

.619 

2.309 

C16 

.2865 

.3763 

.6628 

.750 

.956 

1.706 

Class  Aggregate  Solution 

Utilizations 

Queue  Lengths 

Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.5660 

.1884 

.7544 

1.801 

.602 

2.403 

C16 

.2830 

.3767 

.6597 

.727 

.972 

1.700 

Customer  Aggregate  Solution 

Utilizations 

Queue  Lengths 

Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.5266 

.1814 

.7080 

CIS 

.2633 

.3727 

.6360 

.672 

.913 

1.585 

Exact  Solution 

-  Na  =  14 

$ 

.11 

00 

Utilizations 

Queue  Lengths 

Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.8937 

.0489 

.9426 

5.199 

.317 

5.512 

C10 

.4468 

.0978 

.5447 

.936 

.196 

1.132 

Class  Aggregate  Solution 

Utilizations 

Queue  Lengths 

Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.8903 

.0486 

.9389 

5.262 

.288 

5.550 

C16 

.4551 

.0973 

.5424 

.928 

.203 

1.132 

Customer  Aggregate  Solution 

Utilizations 

Queue  Length 

s 

Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.8112 

.0481 

.8593 

C16 

.4056 

.0962 

.5018 

.785 

.177 

.962 

EXAMPLE  C: 
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XA  =  (1  .25  .25  .25  .25  .25  .25  .25  .25  .5  .5  .5  .5  .5  .5  .5) 
XB  =  (.8  .8  .8  .8  .8  .8  .8  .8  1  1  1  1  1  1  1  l) 


Utilizations  Queue  Lengths 


Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.6321 

.3675 

.9996 

19.548 

11.840 

31.388 

C13 

.3160 

.4594 

.7755 

1.387 

1.952 

3.339 

Class  Aggregate  Solution 

Utilizations 

Queue  Lengths 

Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.6317 

.3667 

.9984 

19.486 

11.332 

30.818 

C16 

.3159 

.4584 

.7742 

1.389 

2.019 

3.407 

Customer  Aggregate 

Utilizations 

Queue  Lengths 

Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.5907 

.3660 

.9567 

C16 

.2954 

.4575 

.7529 

1.260 

1.881 

3.141 

Exact  Solution 

-  Na  =  62  Nb  -  2 

Utilizations 

Queue  Lengths 

Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.9759 

.0241 

1.000 

52.245 

1.301 

53.546 

CIS 

.4879 

.0302 

.518 

1.012 

.061 

1.074 

Class  Aggregate  Solution 

Utilizations 

Queue  Lengths 

Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.9758 

.0242 

1.000 

52.221 

1.304 

53.525 

CIS 

.4879 

.0303 

.518 

1.0126 

.0632 

1.076 

Customer  Aggregate  Solution 

Utilizations 

Queue  Lengths 

Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.9593 

.0238 

.9831 

C16 

.4797 

.0297 

.5094 

1.114 

.067 

1.181 

Exact  Solution 

-  Na  =  8  Nb  =  8 

Utilizations 

Queue  Lengths 

Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.5720 

.2275 

.7995: 

1.879 

.826 

2.705 

CIS 

.2860 

.2844 

.5704 

.617 

.609 

1.226 

Class  Aggregate  Solution 

Utilizations 

Queue  Lengths 

Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.5652 

.2279 

.7931 

1.981 

.802 

2.783 

C16 

.2826 

.2849 

.5675 

.604 

.611 

1.215 

Customer  Aggregate  Solution 

Utilizations 

Queue  Lengths 

Device 

Class  A 

Class  B 

Total 

Class  A 

Class  B 

Total 

Cl 

.5223 

.2279 

.7507 

C16 

.2614 

.2848 

.5462 

.535 

.577 

1.112 
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Exact  Solution  -  NA  =  14  NB  =  2 


Utilizations 


Queue  Lengths 


Device  Class  A  Class  B 
Cl  .8893  .0576 

C16  .4449  .0719 


Total  Class  A  Class  B  Total 
.9474  5.332  .379  5.711 

.5169  .887  .140  1.027 


Class  Aggregate  Solution 


Utilizations 


Queue  Lengths 


Device  Class  A  Class  B 
Cl  .8870  .0576 

C16  .4435  .0721 


Total  Class  A  Class  B  Total 
.9446  5.378  .351  5.729 

.5155  .860  .144  1.024 


Customer  Aggregate  Solution 


Utilizations 


Queue  Lengths 


Device  Class  A  Class  B 
Cl  .8078  .0576 

C16  .4039  .0720 


Total  Class  A  Class  B  Total 
.8654 

.4759  .737  .129  .856 


The  approximation  is  quite  accurate.  Utilizations  are  generally  -within  1.5% 
of  the  exact  solution,  and  queue  lengths  within  5%.  The  approximation  has  a  ten¬ 
dency  to  overestimate  the  queue  length  of  a  class  at  its  own  bottleneck  service 
center,  and  to  underestimate  it  at  the  bottlenecks  of  the  other  classes.  This 
means  that  the  estimate  for  the  total  queue  length  at  a  device  is  often  more 
accurate  than  the  class  dependent  queue  length  estimates. 

Figure  4.2  shows  the  error  of  the  aggregation  procedure  as  a  function  of  the 
similarity  of  the  classes  aggregated.  The  similarity  of  the  classes  is  regulated  by 
a  parameter  £,  which  is  used  to  scale  portions  of  the  relative  utilization  vectors 
of  each  class  (as  shown  in  the  figure).  For  t  —  1,  the  classes  are  identical.  For 
£=0,  the  classes  are  disjoint  (that  is,  they  do  not  have  any  service  centers  in 
common).  The  graphs  show  that  the  error  falls  off  quickly  as  the  amount  of 
interaction  between  classes  increases.  We  have  also  noted  that,  in  general,  the 
greater  error  is  observed  in  the  class  with  the  fewest  customers,  as  shown  in  the 
diagrams. 

A  typical  situation  in  which  the  aggregation  procedure  might  be  applied  is 
the  solution  of  models  of  teleprocessing  systems.  These  types  of  systems  often 
have  a  central  computing  facility  with  which  remote  users  communicate  over 
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Na  -  4  XA  =  (It  2t  2t  3  4  5) 
tfp  =  4  *£=(122  3t  4t  5t) 


% 


Error 


t 


% 

Error 


t 


Figure  4.2 
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shared  transmission  Lines.  Since  each  set  of  users  communicate  over  different 
transmission  lines,  it  is  reasonable  to  model  them  as  separate  customer  classes 
(one  class/line). 


Figure  4.3  -  Teleprocessing  Model 

Figure  4.3  shows  a  simple  instance  of  such  a  model.  There  are  four  job 
classes,  each  with  eight  customers.  Each  class  represents  the  traffic  on  a 
different  transmission  line. 

There  are  two  basic  situations  that  occur  in  such  systems.  One  is  that  the 
central  subsystem  is  the  bottleneck,  while  the  other  is  that  the  transmission 
lines  are  the  bottlenecks.  We  have  applied  the  class  aggregation  to.  the  model  of 
figure  4.3  for  both  cases,  each  time  aggregating  all  classes  into  a  single  class. 
The  parameter  settings  used,  and  the  results  obtained,  are  shown  in  table  4.5. 

Table  4.5  -  Teleprocessing  Models 

Na  =8  NB  =  8 

Nc  =  3  Nd  -  8  Both  Models 

Saturated  transmission  lines  model: 

(1  10  0  0  0)  x3  -  (1  0  10  0  0) 

Xc  =  (1  o  0  10  0)  ^  =  (1  000  10) 

Exact  Solution  -  Cost  =  131,220 
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Utilizations 


Device 

Class  A 

Class  B 

Class  C 

Class  D 

Total 

Central 

.100 

.100 

.100 

.100 

.400 

Line  A 

1.000 

.000 

.000 

.000 

1.000 

Line  B 

.000 

1.000 

.000 

.000 

1.000 

Line  C 

.000 

.000 

1.000 

.000 

1.000 

line  D 

.000 

.000 

.000 

1.000 

1.000 

Queue  Lengths 


Device 

Class  A 

Class  B 

Class  C 

Class  D 

Total 

Centred 

.187 

.167 

.167 

.167 

.667 

line  A 

7.833 

.000 

.000 

.000 

7.833 

Line  B 

.000 

7.833 

.000 

.000 

7.833 

line  C 

.000 

.000 

7.833 

.000 

7.833 

Line  D 

.000 

.000 

.000 

7.833 

7.033 

Class  Aggregation  Results  -  Cost  =  165/iteratlon  x  1  iteration 

Utilizations 


Device 

Class  A 

Class  B 

Class  C 

Class  D 

Total 

Central 

.0913 

.0913 

.0913 

.0913 

.3651 

line  A 

.9128 

.0000 

.0000 

.0000 

.9128 

line  B 

.0000 

.9128 

.0000 

.0000 

.9128 

line  C 

.0000 

.0000 

.9128 

.0000 

.9128 

Line  D 

.0000 

.0000 

.0000 

.9120 

.9128 

Queue  Lengths 


Device 

Class  A 

Class  B 

Class  C 

Class  D 

Total 

Central 

.143 

.143 

.143 

.143 

.574 

line  A 

7.057 

.000 

.000 

.000 

7.057 

line  B 

.000 

7.857 

.000 

.000 

7.057 

Line  C 

.000 

.000 

7.857 

.000 

7.057 

Line  D 

.000 

.000 

.000 

7.057 

7.057 

Saturated  central  system  model: 

XA  -  (2  5000)  Xff  =  (20500) 
^  =  (20050)  Xq  =  (2  0  0  0  5) 

Exact  Solution  -  Cost  =  131,220 

Utilizations 


Device 

Class  A 

Class  B 

Class  C 

Class  D 

Total 

Central 

.250 

.250 

.250 

.250 

1.000 

Line  A 

.625 

.000 

.000 

.000 

.625 
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Line  B 

.000 

.625 

.000 

.000 

.625 

Line  C 

.000 

.000 

.625 

.000 

.625 

Line  D 

.000 

.000 

.000 

.625 

.625 

Queue  Lengths 


Device 

Class  A 

Class  B 

Class  C 

Class  D 

Total 

Central 

6.697 

6.697 

6.697 

6.697 

26.788 

line  A 

1.303 

.000 

.000 

.000 

1.303 

Line  B 

.000 

1.303 

.000 

.000 

1.303 

line  C 

.000 

.000 

1.303 

.000 

1.303 

line  D 

.000 

.000 

.000 

1.303 

1.303 

Class  Aggregate  Solution  -  Cost  =  165/iteration  x  1  iteration 

Utilizations 


Device 

Class  A 

Class  B 

Class  C 

Class  D 

Total 

Central 

.250 

.250 

.250 

.250 

1.000 

line  A 

.625 

.000 

.000 

.000 

.625 

line  B 

.000 

.625 

.000 

.000 

.625 

line  C 

.000 

.000 

.625 

.000 

.625 

line  D 

.000 

.000 

.000 

.625 

.625 

Queue  Lengths 


Device 

Class  A 

Class  B 

Class  C 

Class  D 

Total 

Central 

6.339 

6.339 

6.339 

6.339 

25.336 

Line  A 

1.666 

.000 

.000 

.000 

1.666 

Lane  B 

.000 

1.666 

.000 

.000 

1.666 

Line  C 

.000 

.000 

1.666 

.000 

1.68Q 

Line  D 

.000 

.000 

.000 

1.666 

1.666 

The  solutions  of  these  two  models  show  that  the  class  aggregation  technique 
is  more  accurate  when  the  aggregate  classes  share  a  bottleneck  than  when  they 
have  distinct  bottlenecks.  This  is  expected,  since  they  are  more  similar  in  the 
first  case  than  in  the  second.  The  effect  of  combining  them  is  therefore  less 
severe. 


Given  a  network  in  which  the  classes  have  distinct  bottlenecks,  it  is  recom¬ 
mended  that  approximate  solutions  be  obtained  by  selecting  each  class  in  turn 
to  leave  unaggregated,  and  aggregating  the  others  as  necessary.  The  result  of 
applying  this  procedure  to  the  teleprocessing  models  is  shown  in  table  4.6.  (The 
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table  shows  the  result  for  aggregating  classes  B.C.  and  D.  The  results  for  the 
other  possible  aggregations  are  symmetric.)  The  maximum  error  for  the  unag¬ 
gregated  classes  in  utilizations  is  reduced  from  -8.7%  to  0%,  and  for  queue 
lengths  from  -14%  to  -3.8%. 


Table  4.6  -  Aggregate  (A)(BCD) 

Saturated  Transmission  lines 
Cost  =  2250 /iteration 


Utilizations 


Device 

Class  A 

Class  B 

Class  C 

Class  D 

Central 

.10 

.09 

.09 

.09 

Line  A 

1.00 

0.00 

0.00 

0.00 

Line  B 

0.00 

.92 

0.00 

0.00 

Line  C 

0.00 

0.00 

.92 

0.00 

Line  D 

0.00 

0.00 

0.00 

.92 

Queue  Lengths 


Device 

Class  A 

Class  B 

Class  C 

Class  D 

Total 

Central 

.160 

.148 

.148 

.148 

.603 

Line  A 

7.840 

.000 

.000 

.000 

7.840 

Line  B 

.000 

7.853 

.000 

.000 

7.853 

Line  C 

.000 

.000 

7.853 

.000 

7.853 

Line  D 

.000 

.000 

.000 

7.353 

7.853 

Saturated  Central  System  Model 
Cost  =  2250/iteration 


Utilizations 


Device 

Class  A 

Class  B 

Class  C 

Class  D 

Central 

.256 

248 

.248 

.248 

Line  A 

.641 

0.000 

0.000 

0.000 

line  B 

0.000 

.620 

0.000 

0.000 

Line  C 

0.000 

0.000 

.620 

0.000 

Line  D 

0.000 

0.000 

0.000 

.620 

Queue  Lengths 


Device 

Class  A 

Class  B 

Class  C 

Class  D 

Total 

Central 

6.616 

6.420 

6.420 

6.420 

25.906 

Line  A 

1.304 

.000 

.000 

.000 

1.384 

Line  B 

.000 

1.570 

.000 

.000 

1.570 

Line  C 

.000 

.000 

1.570 

.000 

1.570 

Line  D 

.000 

.000 

.000 

1.570 

1.570 

The  benefit  of  leaving  a  designated  class  unaggregated  is  shown  also  in  the 
teleprocessing  model  with  a  load  dependent  central  system  service  center.  The 
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load  dependent  rates  at  the  central  server  reflect  the  fact  that  the  central  sys¬ 
tem  is  capable  of  serving  more  than  one  job  at  a  time.  The  throughput  of  the 
system  therefore  increases  with  the  load  to  some  limit  (when  the  system  over¬ 
head  becomes  excessive,  a  device  saturates,  or  the  multiprogramming  level  lim¬ 
its  are  achieved). 


Table  4.7  -  Load  Dependent  Teleprocessing  Model 

Na  =  4  Nb  =  4  Nc  =4  Nd  =  4 
ax{n)  =  1  1.8  2.7  3.6  4.4  5.1  5.7  6.2  6.3  6.3  ... 

Saturated  Central  System 

XA  =  63000 
XB  =  6  0  3  0  0 
=  6  0  0  3  0 
=  6  0  00  3 

Exact  Solution 


Utilization  Queue  Length 


Device 

Class  A 

Class  B 

Class  C 

Class  D 

Class  A 

Class  B 

Class  C 

Class  D 

line  A 

.748 

.000 

.000 

.000 

1.519 

.000 

.000 

.000 

Line  B 

.000 

.748 

.000 

.000 

.000 

1.519 

.000 

.000 

line  C 

.000 

.000 

.748 

.000 

.000 

.000 

1.519 

.000 

Line  D 

.000 

.000 

.000 

.748 

.000 

.000 

.000 

1.519 

Central 

1.496 

1.496 

1.496 

1.496 

2.481 

2.481 

2.481 

2.481 

Aggregate  Solution  -  (ASCD) 

Utilization  Queue  Length 


Device 

Class  A 

Class  3 

Class  C 

Class  D 

Class  A 

Class  B 

Class  C 

Class  D 

line  A 

.691 

.000 

.000 

.000 

1.950 

.000 

.000 

.000 

Line  B 

.000 

.691 

.000 

.000 

.000 

1.950 

.000 

.000 

line  C 

.000 

.000 

.691 

.000 

.000 

.000 

1.950 

.000 

Line  D 

.000 

.000 

.000 

.691 

.000 

.000 

.000 

1.950 

Central 

1.383 

1.383 

1.303 

1.303 

2.050 

2.050 

2.050 

2.050 

Aggregate  Solution  -  (A)  (BCD) 


Utilization  Queue  Length 


Device 

Class  A 

Class  B 

Class  C 

Class  D 

Class  A 

Class  B 

Class  C 

Class  D 

line  A 

.777 

.000 

.000 

.000 

1.632 

.000 

.000 

.000 

Line  B 

.000 

.693 

.000 

.000 

.000 

1.859 

.000 

.000 

line  C 

.000 

.000 

.693 

.000 

.000 

.000 

1.859 

.000 

Line  D 

.000 

.000 

.000 

.693 

.000 

.000 

.000 

1.859 

Central 

1.554 

1.385 

1.385 

1.385 

2.360 

2.141 

2.141 

2.141 

Saturated  Transmission  Lines 
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XA  =23000 
XD  =20300 
Xq  =20030 
XD  =  2  0  0  0  3 


Exact  Solution 


Utilization  Queue  Length 


Device 

Class  A 

Class  B 

Class  C 

Class  D 

Class  A 

Class  B 

Class  C 

Class  D 

line  A 

.993 

.000 

.000 

.000 

3.252 

.000 

.000 

.000 

Line  B 

.000 

.993 

.000 

.000 

.000 

3.252 

.000 

.000 

Line  C 

.000 

.000 

.993 

.000 

.000 

.000 

3.252 

.000 

Line  D 

.000 

.000 

.000 

.993 

.000 

.000 

.000 

3.252 

Centred 

.662 

.662 

.662 

.662 

.748 

.748 

.748 

.748 

Aggregate  Solution 

-  (ABCD) 

Utilization 

Queue  Length 

Device 

Class  A 

Class  B 

Class  C 

Class  D 

Class  A 

Class  B 

Class  C 

Class  D 

line  A 

.817 

.000 

.000 

.000 

3.394 

.000 

.000 

.000 

Line  B 

.000 

.017 

.000 

.000 

.000 

3.394 

.000 

.000 

line  C 

.000 

.000 

.817 

.000 

.000 

.000 

3.394 

.000 

Line  D 

.000 

.000 

.000 

.817 

.000 

.000 

.000 

3.394 

Central 

.545 

.545 

.545 

.545 

.600 

.606 

.606 

.606 

Aggregate  Solution  - 

(A)  (BCD) 

Utilization 

Queue  Length 

Device 

Class  A 

Class  B 

Class  C 

Class  D 

Class  A 

Class  B 

Class  C 

Class  D 

line  A 

.993 

.000 

.000 

.000 

3.260 

.000 

.000 

.000 

Line  B 

.000 

.033 

.000 

.000 

.000 

3.379 

.000 

.000 

Line  C 

.000 

.000 

.833 

.000 

.000 

.000 

3.379 

.000 

Line  D 

.000 

.000 

.000 

.033 

.000 

.000 

.000 

3.379 

Central 

.662 

.555 

.555 

.555 

.741 

.621 

.621 

.621 

Two  sets 

of  parameter 

values 

for  the 

teleprocessing 

model 

of  figure 

using  a  load  dependent  central  system  are  listed  in  tabic  4.7;  one  set  represents 
a  saturated  central  system,  and  the  other  saturated  transmission  lines.  The 
exact  solution  and  the  results  of  aggregating  all  the  classes  and  all  but  a  single 
designated  class  are  also  shown.  For  the  saturated  transmission  line  model,  the 
two  aggregate  class  solution  reduces  the  error  of  the  single  aggregate  class 
model  from  -17.7%  to  0%  for  throughputs,  and  from  4.3%  to  .2%  for  the  queue 
length  at  the  transmission  line  (the  bottleneck). 


In  summary,  the  class  aggregation  procedure  is  most  accurate  when  the 
classes  aggregated  share  a  bottleneck.  The  accuracy  of  the  solution  can  be 
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improved  over  that  obtained  by  aggregating  all  classes  into  one  by  performing  R 
aggregations,  each  of  which  selects  a  different  class  as  one  aggregate  and  the 
remaining  classes  as  the  other.  It  is  possible  that  these  R  solutions  may  not  be 
compatible  (e.g.,  Yj^r,m  >  1  f°r  some  center  m).  However,  in  practice  this  has 

r 

not  been  a  problem,  and  so  no  special  adjustments  of  the  individual  results  are 
recommended. 


4.7  SELECTING  THE  AGGREGATES 


A  natural  question  to  ask  when  forming  the  aggregate  model  is  which 
classes  should  be  combined  into  aggregates.  The  answer  to  this  question 
depends  on  whether  one  is  interested  in  minimizing  the  total  error  over  all 
classes,  or  is  interested  primarily  in  minimizing  the  error  in  a  single  class. 


We  first  address  the  problem  of  minimizing  the  total  error  over  all  classes. 
We  assume  that  the  user  has  determined  the  amount  of  time  and  space  that  he 
is  willing  to  devote  to  the  solution  of  the  aggregate.  Our  task  is  to  select  one  of 


R 

the  g 
n  =  l 


R 

n 


possible  aggregation  schemes,  where 


r 

In 


is  a  Stirling  number  of  the 


second  kind  [Knuth  69]. 


The  intuitive  solution  to  this  problem  is  that  one  should  aggregate  those 
classes  that  are  most  alike.  In  fact,  this  is  a  very  good  heuristic,  and  is  the  basis 
of  our  partial  answer  to  the  problem.  A  difficulty  in  applying  this  intuition, 
though,  is  quantifying  the  similarity  of  two  classes. 


Our  solution  to  the  problem  of  minimizing  the  total  error  in  the  aggregation 
process  is  limited  to  addressing  the  problem  of  determining  which  classes  are 
most  similar.  The  obvious  approach  to  this  problem  is  to  compare  the  relative 
utilization  vectors  of  the  R  classes.  Since  the  vectors  contain  only  relative  infor¬ 
mation,  we  must  first  standardize  them  by  putting  them  in  some  canonical  form. 
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We  choose  to  do  this  by  assuming  arbitrarily  that  XrjjAx  =  1  for  all  classes  r, 
where  XruAx  =  MAX  [Xr>m].  Vector  differences  can  then  be  taken  as  measures  of 

771 

the  similarity  of  the  classes. 

There  are  two  problems  with  this  approach.  The  first  is  the  obvious  one  that 
a  combinatorial  number  of  differences  must  be  computed  and  used  as  the  basis 
of  the  aggregation  decisions.  The  second  problem  is  that  even  the  canonical 
representation  of  the  relative  utilization  vectors  does  not  contain  enough  infor¬ 
mation  to  decide  which  two  classes  of  three  are  most  alike.  For  example,  con¬ 
sider  classes  A,B,  and  C  with  canonical  relative  utilization  vectors 

*4=0  0  0) 

XB  =  (  1  1  1  )  (4.5) 

Xc  =  (  0  1  1  ) 

Define  the  difference  of  two  classes  r,  s,  DTiS,  to  be 

Dr,s  =  (  £  (Xr.m-Xs,m)S)i-''1 

m  =  1 

Then 

da,b  - 
Da,c  =  ^3 

&B,C  -  1 

It  would  seem  then  that  classes  B  and  C  are  most  alike.  However,  a  solution  of 
the  network  with  32  class  A  customers,  2  class  B,  and  2  class  C  shows  mean 
queue  lengths  of 

Qa  =  (32  0  0) 

Qb  =(1.78.11.11) 

Qc  =  (0  1.0  1.0) 

Clearly,  classes  A  and  B  are  most  similar,  since  the  customers  in  each  spend 
most  of  their  time  at  center  one,  while  class  C  never  visits  there.  This  contrad¬ 
icts  the  intuition  obtained  from  the  relative  utilization  vectors. 


Our  proposed  solution  to  these  problems  is  a  heuristic  that  orders  the 
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classes  such  that  each  class  is  most  reasonably  paired  with  its  predecessor  or 
successor.  To  create  the  ordering  we  require  some  information  about  absolute 
utilizations.  This  information  is  obtained  from  the  cheap,  approximate  solution 
of  the  network  obtained  by  aggregating  all  classes  into  one.  (This  is  the 
cheapest  solution  possible  using  only  class  aggregation,  and  therefore  must  be 
feasible  if  any  other  aggregations  are  feasible.)  Using  the  information  obtained 
from  this  solution,  we  order  the  absolute  utilization  vectors  of  each  class  so  that 
the  service  center  with  the  highest  total  utilization  (i.e.,  the  bottleneck)  is 
center  one,  the  next  highest  utilized  center  is  center  two,  etc.  Finally,  each  ele¬ 
ment  of  every  vector  is  divided  by  the  number  of  customers  in  the  class  to  which 
that  vector  pertains,  and  the  classes  are  sorted  according  to  the  resulting  vec¬ 
tors  (left  most  element  most  significant).  The  ordering  that  results  indicates 
that  a  class  is  most  similar  to  its  closest  neighbors. 

Having  established  this  ordering,  it  is  then  up  to  the  user  to  decide  how- 
many  aggregate  classes  to  use.  As  is  expected,  more  classes  generally  lead  to 

greater  accuracy.  The  user  might  select  natural  groupings  from  the  list  on  the 
basis  of  natural  breaking  points  that  occur  in  it.  For  instance,  it  may  be  obvious 
from  the  list  that  the  first  two  classes  are  similar  to  each  other  but  substantially 
different  from  the  rest,  and  so  should  be  grouped  separately).  Alternatively,  he 
can  decide  beforehand  what  his  limitations  in  computing  power  imply  about  the 
number  of  aggregates,  and  simply  "draw  lines"  through  the  list  to  create  the 
aggregates. 

As  an  example,  consider  the  network  described  by  (4.5)  with  32  class  A,  2 
class  B,  and  2  class  C  customers.  Our  goal  is  to  select  two  classes  to  aggregate 
in  an  aggregate  model  (one  class  is  to  be  left  unaggregated).  We  begin  by  solv¬ 
ing  the  aggregate  single  class  model.  From  this  solution  we  obtain  estimates  for 


the  absolute  utilizations  of  the  three  service  centers 
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UA  -  (  .9473  .0000  .0000) 

UB  =  (  .0527  .0527  .0527) 

Uc  ~  (  .0000  .4737  .4737) 

1.000  .5264  .5264 

We  next  divide  each  vector  by  the  number  of  customers  in  the  class,  reorder 
them  according  to  the  total  utilizations  of  the  devices,  and  sort.  This  gives 

UA  =  ( .0296  .0000  .0000  ) 

UB  -  ( .0264  .0264  .0264  ) 

Uc  =  ( .0000  .2369  .2369  ) 

This  list  clearly  shows  that  classes  A  and  B  should  be  the  ones  aggregated,  since 
they  are  most  similar.  The  results  of  all  possible  aggregations  are  shown  in 
table  4.8.  The  aggregation  in  which  classes  A  and  B  are  combined  is  the  most 
accurate  two  class  aggregation  (in  terms  of  overall  error). 


Table  4.8  -  Results  of  All  Possible  Aggregations 


Aggregation:  (ABC) 


Utilizations 


Device 

Class  A 

Class  B 

Class  C 

Total 

Cl 

.9473 

.0527 

.0000 

1.000 

C2 

.0000 

.0527 

.4737 

.5264 

C3 

.0000 

.0527 

.4737 

.5264 

Queue  Lengths 


Device 

Class  A 

Class  B 

Class  C 

Total 

Cl 

31.995 

1.782 

.000 

33.777 

C2 

.000 

.ill 

.997 

1.108 

C3 

.000 

.111 

.997 

1.108 

Aggregation:  (A)  (BC) 


Utilizations 


Device 

Class  A 

Class  B 

Class  C 

Total 

Cl 

.9474 

.0526 

.0000 

1.000 

C2 

.0000 

.0526 

.5698 

.0225 

C3 

.0000 

.0526 

.5698 

.6225 

Queue  Lengths 

Device 

Class  A 

Class  B 

Class  C 

Total 

Cl 

32.000 

1.8.16 

.000 

33.816 

C2 

.000 

.092 

1.000 

1.092 
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Aggregation:  (B)  (AC) 


Aggregation:  (C)  (A3) 


Aggregation:  (A)  (B)  (C) 


C3 

.000 

.092 

1.000 

1.092 

Utilizations 

Device 

Class  A 

Class  B 

Class  C 

Total 

Cl 

.9473 

.0520 

.0000 

.9999 

C2 

.0000 

.0526 

.4737 

.5236 

C3 

.0000 

.0526 

.4737 

.5236 

Queue  Lengths 


Device 

Class  A 

Class  B 

Class  C 

Total 

Cl 

32.000 

1 .784 

.000 

33.754 

C2 

.000 

.108 

1.000 

1.108 

C3 

.000 

.100 

1.000 

1.100 

Utilization 


Device 

Class  A 

Class  B 

Class  C 

Total 

Cl 

.9472 

.0528 

.0000 

1.000 

C2 

.0000 

.0528 

.6313 

.684 

C3 

.0000 

0528 

.6313 

.684 

Queue  Lengths 


Device 

Class  A 

Class  B 

Class  C 

Total 

Cl 

31.986 

1.790 

.000 

33.776 

C2 

.000 

.112 

1  000 

1.112 

C3 

.000 

.112 

1.000 

1.112 

Device 

Class  A 

Utilization 

Class  B 

Class  C 

Total 

Cl 

.9474 

.0526 

.0000 

1.000 

C2 

.0000 

.0526 

.6318 

.684 

C3 

.0000 

.0526 

.6310 

.684 

Device 

Queue  Lengths 

Class  A  Class  B  Class  C 

Total 

Cl 

32.000 

1.784 

.000 

33.784 

C2 

.000 

.108 

1.000 

1.108 

C3 

.000 

.108 

1.000 

1.108 

The  procedure  described  above  gives  a  way  to  select  aggregates  in  an 

attempt  to  minimize  total  error.  However,  it  might  be  the  case  that  one  is  pri¬ 
marily  interested  in  statistics  about  a  particular  class,  and  is  not  as  concerned 
about  the  other  classes.  In  these  cases  the  recommended  procedure  is  not  to 
aggregate  the  class  of  interest  with  any  others,  but  to  reduce  the  complexity  of 
the  aggregate  model  as  required  by  aggregating  the  other  classes.  The  choice  of 
these  aggregates  can  be  made  to  minimize  total  error  among  them  according  to 
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the  procedure  described  earlier  in  this  section. 

Table  4.8  is  an  example  showing  that  leaving  the  designated  class  unaggre¬ 
gated  is  a  good  heuristic.  The  results  of  the  aggregations  of  the  three  class  net¬ 
work  show  that  the  most  accurate  information  about  individual  classes  is 
obtained  from  aggregating  the  two  other  classes.  In  the  ideal  case,  we  would 
like  to  obtain  exact  solutions  for  a  particular  class,  say  class  A,  using  only  a  two 
class  model.  One  of  these  two  classes  would  be  class  A,  while  the  other  would  be 
an.  aggregate  "complementary  class".  We  define  the  complementary  class  of 
class  A  to  be  the  class  of  customers  that  causes  class  A  to  have  the  same 
throughputs  and  utilizations  in  the  aggregate  network  as  in  the  full  network. 

It  is  possible  to  show  that,  in  load  independent  networks,  a  complementary 
class  always  exists  for  at  least  one  type  of  customer  class,  balanced  classes  (i.e., 
classes  with  relative  utilizations  at  all  centers  equal).  This  result  is  stated  in  the 
following  lemma: 


Lemma:  In  a  load  independent  queueing  network  model,  the  throughputs  (and 
therefore  the  utilizations)  of  A  balanced  class  are  affected  only  by  the 
total  number  of  customers  in  the  other  classes,  and  not  by  the  number 
of  other  classes  nor  the  relative  utilization  vectors  of  those  classes. 


Proof:  The  throughput  of  customers  of  balanced  class  A  in  a  load  independent 
network  is  given  by  [Reiser  and  Lavenberg  78]: 


£  ^.m 


m—1 


where 

(l+7Utf-^)) 

and  Tbniti  “ )  is  the  mean  queue  length  at  center  m  in  the  network 
with  one  class  A  customer  removed.  If  class  A  is  balanced,  XA,m  -  c 
(constant)  for  all  m.  We  have  therefore 

rn  „ _ NA _ 

*A—  u 

£  c  (  l+Tfcra  —<$a)) 

m  =  l 

= _ Na _ 

c  (M  +  tnntf-gi)) 

m~  1 
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-  N* 

c(M  +  f]Nr  -  1) 

r= 1 

The  throughput  depends  only  on  the  total  number  of  customers  in  the 
network,  and  is  independent  of  both  R  and  the  particular  values  of  the 
Nr.  Thus  the  lemma  is  proven. 

The  lemma  shows  that  we  can  adjust  the  relative  utilization  vector  of  the  com¬ 
plementary  class  as  we  like  without  affecting  the  utilizations  of  class  A.  There¬ 
fore,  any  class  with  the  correct  number  of  customers  is  a  complementary  class 
for  class  A. 

This  result  encourages  us  to  believe  that  complementary  classes  exist  for 
all  classes.  However,  at  present  this  is  still  a  conjecture. 

4.B  SUMMARY 

We  have  presented  a  procedure  for  obtaining  cheap,  approximate  solutions 
of  separable  queueing  network  models  by  the  aggregation  of  classes.  The 
approximation  can  be  used  on  both  load  independent  and  load  dependent 
centers.  Analytic  error  bounds  for  an  idealized  version  of  the  procedure  have 
been  developed.  As  well,  the  empirical  behavior  of  the  error  has  been  described. 
The  information  obtained  is  useful  both  as  an  indication  of  the  accuracy  of  the 
approximate  solution  technique,  and  as  a  guide  to  the  error  incurred  by  reduc¬ 
ing  measurement  data  about  a  computer  system  with  multiple  job  types  into  a 
queueing  network  model  of  that  system  having  fewer  job  types. 


CHAPTER  5  -  AGGREGATION  OF  CUSTOMERS 


In  this  chapter  we  develop  an  approximate  solution  technique  based  on  the 
aggregation  of  customers.  As  in  the  previous  chapter,  our  goal  is  to  develop  a 
procedure  that  computes  accurate  approximations  to  the  exact  solution  at 
greatly  reduced  costs. 

Our  basic  scheme  is  to  aggregate  some  customers  of  a  given  network  QN  to 
produce  an  approximately  equivalent  network  QR.  QR  has  the  same  number  of 
centers  and  classes  as  QN,  but  has  fewer  customers  in  some  (possibly  all)  of  its 
classes.  In  a  sense,  some  customers  of  Q$  represent  many  customers  of  QN. 

The  solution  of  Q$  is  much  cheaper  than  that  of  QN  since  the  cost  of  solu- 

R 

tion  of  a  network  grows  as  RM  JJ  (NV+l)  for  load  independent  and  as 

r=l 


J~J  (A/V+l)2  for  load  dependent  networks.  If  the  number  of  customers  in 
2  r=l 


R 


class  r  of  QN  is  1  /j)T  the  number  in  QN,  the  relative  cost  of  Q$  is  1/j'T  pr  for 

r  =  1 


JR 

load  independent  networks,  and  1/JJ  Vr  ?or  load  dependent  ones.  In  the  next 

r=  1 


section  we  discuss  the  customer  aggregation  procedure  for  single  class  net¬ 
works.  These  results  are  extended  to  multiciass  networks  in  subsequent  sec¬ 
tions. 


5.1  SINGLE  CLASS  NETWORKS 

In  this  section  we  present  the  customer  aggregation  technique  for  single 
class  networks.  The  aggregation  procedure  is  particularly  useful  when  there  are 
many  customers  and  some  load  dependent  service  centers,  or  when  great  accu¬ 
racy  of  solution  is  not  needed.  This  suggests  that  the  customer  aggregation  pro¬ 
cedure  could  be  used  during  the  iteration  for  class  aggregation  to  obtain  quickly 
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a  good  estimate  for  the  weights  WT. 


5.1.1  Aggregation  Procedure 

Consider  a  single  class  network  QN  with  M  service  centers  and  N  customers. 

The  service  centers  may  be  load  dependent,  with  load  dependent  rates  for 
center  m  given  by  am(n  ),  n  = 

Our  technique  is  to  aggregate  each  group  of  p  customers  of  QN  into  a  single 
customer  of  Qft,  2^p^~.  *  The  population  of  Qft,  ft,  is  given  by 

£4 


V 


where  farl  denotes  the  smallest  integer  greater  than  x. 

The  relative  throughputs,  em,  and  service  times,  §m,  of  Qft  are  set  to 


(5.1) 


em  —  (emP 

S  rri  =  (‘S'm 

This  implies  that  the  relative  utilization  at  center  m  of  the  customers  of  Qft,  Xm, 
is 


%m  =  {Xmy  (5.2) 

The  load  dependent  rates  of  center  m  in  Qft  are  set  to 

am(n)  =  CLm(pn)*am(pn-l)*.  ..*am(pn-p  +  l)  n  =  l,...,ft  (5.3) 

The  aggregate  network  QN  defined  by  equations  5.1,  5.2,  and  5.3  has  the 

property  that  each  of  its  states  corresponds  in  a  natural  way  to  a  state  of  QN 
with  pft  customers.  Figure  5.1  shows  the  state  space  for  a  load  dependent  net¬ 
work  with  M- 2,  N= 4,  and  p=  2.  As  shown  in  the  diagram,  each  state 
3 Qft  =  (n1,n2,...,n^)  of  Qft  corresponds  to  the  state  Sqm  =  (pnll_p7i2,...,pnj/)  of 

QN.  Further,  the  ratio  of  the  equilibrium  probabilities  of  any  two  states  of  Qft  is 

*It  is  possible  to  pick  non-integer  p  (including  p< l)  and  still  apply  the  technique  about  to  be 
described.  However,  these  choices  for  p  have  the  disadvantage  that  approximations  for  po¬ 
pulations  less  than  N  may  not  be  computed  automatically  as  part  of  the  aggregation  pro¬ 
cedure. 
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equal  to  the  ratio  of  the  corresponding  states  of  QN . 


Unnormalized 

QN  Probabilities  QN 

4;0  Xf /Ai(4)  2;0 

3;  1  X?Xs/Ai(3) 

2; 2  XfX§  /At(S)At(Z)  1;  1 
1:3  XtXg  /  Az('A) 

0;4  Xi  /  A  4)  0;2 

Figure  5.1  -  Exact  and  Aggregate  State  Spaces 


The  customer  aggregation  technique  can  be  interpreted  as  an  approximate 
computation  of  the  solution  of  QN  from  knowledge  of  the  ratios  of  the  probabili¬ 
ties  of  a  subset  of  its  states.  For  example,  the  aggregation  of  two  customers  into 


one  is  really  the  examination  of  the 


U+2--1 
2 

M- 1 


states  of  QN  that  have  an  even 


number  of  customers  at  every  service  center  (see  figure  5.1  for  example).  Simi¬ 
lar,  though  smaller,  subsets  are  represented  by  aggregating  more  than  2  custo¬ 


mers  into  one. 

The  aggregate  model  Q$  can  be  solved  using  any  exact  solution  technique 
for  separable  networks.  We  assume  here  that  a  convolution  technique  has  been 
used,  and  express  the  resulting  performance  measures  in  those  terms  for  nota- 
tional  convenience.  The  throughput  of  center  m  in  Q& ,  Tm,  is  given  by 

&  _  -  1) 

Tm  -  c^wr 

and  the  utilization,  Um,  by 


om  = 


„  1) 

Xm — —  if  center  m  is  load  independent 


1 - 5 


G~(P)~  ^  center  m  is  load  dependent  /independent 
The  equilibrium  probability  of  observing  n  customers  at  center  m,  qm(n),  is 


5  mW 


Nn 

l))  (load  independent) 


V  n 
^  m 


II S  m  O') 

j  =  1 


(load  depen. /indep.) 
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Given  the  solution  to  Q$,  we  can  approximate  the  performance  measures  of 
QN  in  a  simple  manner. 

=  e?my/p  (5^) 

Cm  =  (Cm)'^  (5-5) 

Here  ?*m  and  Um  are  approximations  to  the  throughput,  and  utilization  respec¬ 
tively  of  center  m  in  QN.  It  is  also  possible  to  approximate  the  queue  length  dis¬ 
tribution  (and  therefore  the  mean  queue  length)  of  the  service  centers  in  QN. 
However,  before  considering  that  problem,  we  first  consider  the  accuracy  of 
approximations  5.4  and  5.5. 

There  are  two  assumptions  inherent  in  the  aggregation  procedure  that  have 
not  been  explicitly  stated: 


Al:  The  percentage  of  the  total  relative  probability  of  ail  states  of  QN 
examined  by  Q$  is  the  same  for  customer  populations  N  and  N—p 
(of  QN)]  i.e., 

Ca f(pft)  GM{pR-p) 


A2:  The  throughput  of  each  center  of  QN  with  population  pft  is  equal  to 
the  throughputs  with  populations  pfJ- 1,  pR- 2 . pft-p  + 1;  i.e., 

Trn(pft)=Trn(pft-l)  =  ■  ■  ■  =  Tm(pfi-p  +  l)  rn  =  l . M 

Assumptions  Al  and  A2  are  sufficient  for  equation  5.4  to  be  an  equality  (i.e., 
for  the  aggregation  procedure  to  give  exact  throughputs),  and  therefore  for 
equation  5.5  to  hold  for  load  independent  networks.  Let  Tm(N)  be  the  actual 
throughput  of  center  m  in  QN  with  population  N.  Then 


Tm(N)  =  e, 


GM(N- 1) 


0M(N) 

ByA2,  Tm(pN)  =  Tm(pfi- 1) 


Tn(pfi-p  +  l),  so 


Cu(pft-l)' 

cu{pf)~  3) 

CM(pft~p) 

n  cu(p/))  i 

Em  Cu(pfi-l) . 

m  Gtfipft-p  +  1) 

Tm(Nyr  = 
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=  ^  Cu(pfi)- 


By  Al, 


GM(pA'-p)  ' 


so 


Tm(try  =  em 


rm(N)  =  Tlf* 

which  is  approximation  5.4. 

The  utilization  of  load  independent  center  m  in  QN,  Um,  is  given  by 


Um  =  S 


m 


m 


Using  assumptions  Al  and  A2  as  above,  we  have 


um  =  sm  *  Ti(* 

=  (Sft 

=  (Sm 

= 

which  shows  equation  5.5  is  exact  for  load  independent  networks  if  assumptions 
Al  and  A2  hold. 

Mean  queue  lengths  at  load  independent  service  centers  can  be  approxi¬ 
mated  in  a  manner  similar  to  that  used  to  approximate  throughputs.  The  way  in 
which  they  are  computed  depends  on  the  type  of  service  center  involved.  For 
load  independent  centers,  the  mean  queue  length  at  center  m  is  given  by 


„  _  „  CWJV-1) 

Vm-^m  CU(N) 

where  G^+m  is  the  result  of  applying  Buzen’s  algorithm  [Buzen  73]  to  Gy  using 

the  relative  utilization  of  center  m.  An  approximation  for  Qm,  Qm,  can  be  com¬ 
puted  from  the  5#  by 


rw 

Q 


m 


X, 


m 


5ll*m  (jV-l) 


1/p  swtf) 


Here  we  are  again  using  assumptions  Al  and  A2  to  justify  the  approximation 
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GM+m(N  -1) 


l  /P 

e*+T»(tf) 


For  load  dependent  service  centers,  assumptions  similar  to  A1  and  A2  can 
be  used  to  approximate  the  queue  length  distributions.  This  is  done  using  infor¬ 
mation  obtained  from  the  queue  length  distribution  of  those  centers  in  Q$.  Let 
qm\n)  be  the  equilibrium  probability  of  observing  n  customers  at  center  m  in 
Qft,  n=Q,..  ,  We  wish  to  approximate  the  corresponding  distribution, 


7i =  0, ,  N,  for  QN. 


The  assumptions  required  are 


A3:  The  ratio  of  any  two  adjacent  entries  of  qm  is  equal  to  the  ratio  of 
the  two  corresponding  entries  of  qm;  i.e., 

5m(n)  _  qm(pn)  0  . 

7  ~~r~  —  7  7  I  \  \  71  —  U, . « , .  ,iV  1 

?7n(^  +  l)  tfmCpOl  +  l)) 

A4:  The  throughputs  of  the  complementary  subnetwork  of  queue  m  in 
QN  with  populations  kp  -i-l.kp  +2,...,  (k  +  l)p  are  equal, 
k=0, 

We  note  that  assumption  A4  is  trivially  correct  for  networks  consisting  of  two 
load  independent  service  centers.  Assumptions  A3  and  A4  are  equivalent  to  A1 
and  A2  applied  to  QN  with  center  m  removed  for  customer  populations  0  through 

p$. 

The  equilibrium  probability  of  observing  n  customers  at  center  m  of  Qft, 
gm(7i),  can  be  written  as 


m 


(n)  = 


— - ;  at: 


71=0, 


Am(n)  G\j{N) 

where  G^-m  is  the  convolution  matrix  resulting  from  solving  Q$  with  center  m 


n 


removed,  and Am{n)  =  {T am(j ).  Similarly, 

M 


Vmipri)  = 


Am{pn) 


Su-miN-pn) 

Cm(X) 


(5.6) 
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where  and  Am  are  the  corresponding  quantities  defined  for  QN. 

By  A3, 

ggg) 

xy1  e^-w(^-(n-n» 

Ara(n+1)  Su(fl) 


XC  Cu-m  (N-jm) 

Am(pw)  gtf(Af) _  n=0  „ 

^(n*1)  Cn-m(AT-p(n  +  l)) 

Am(p(n  +  1))  Cm(N) 

TT  ("1 

£m(”  +  l)  S _  jSAilI  3  Gu-nAN—pn) 

£m  5M.m(N-(.n  +  l))  X& 

&U-m(ft~n)  _  Clt-mQf-pn) 

&M-rn(N-(n  +  l))  Gu-m(N -p(n  +  l)) 

Given  the  initial  condition  G^_m(0)  =  1  for  all  centers  m,  equation  5.7 

describes  a  method  Tor  computing  subsequent  C^_m(pn ).  n  —  1,  2 . R .  Every  pth 

element  of  qm (n )  can  then  be  computed  using  equation  5.6. 


GM-m(M-p(n+l)) 

n=Q . ft-1  (5.7) 


An  approximation  for  the  queue  length  probabilities  qm(n)  for  n  that  are 
not  multiples  of  p  can  be  developed  using  assumption  A4.  A4  implies  that 

GM-mipn)  _  GM„m(pn  +  l)  __  _  __  GM- m(pn+p-l) 

Git-mipn  +  l)  ”  GM_,m(pn+2)  ~  ~  GM_m(p(n  + 1)) 

The  product  of  these  terms  is  the  inverse  of  the  right  hand  side  of  (5.7)  with  the 

variable  substitution  n  <-  (n  +  l),  so 


CM..m(j>n+k ) 

C.M  -m  (pn  +  /c  -!- 1) 

This  equation  provides  a  method  for  computing  all  the  Gm-tti  from  the  estimates 
for  Gm _m  (pn )  computed  by  (5.7): 


& M-mi 71 ) 


Gn/-m(n  +  l) 


i/p 


k=  0 . p  —  1 


G  M—m  "t  l) 

-m  ) 


1/p 


GM-m(pn+k  + 1)  =  GM-m(pn+k) 


k=0,l,...,p-l 
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An  unnormalized,  queue  length  distribution  can  be  computed  from  the 
approximate  by 

QmM  =  T% T 
Am \n  ) 

where  q'm(n )  denotes  the  unnormalized  probability  of  observing  n  customers  at 
center  m  in  (?_¥.  The  normalized  distribution  qm(n)  can  then  be  computed  as 

,  \  q'min) 

VmW  =  ~N - 

’Zq'm(j) 

i= 0 

We  note  that  the  procedure  described  above  does  not  guarantee  that  the 
sum  of  the  mean  queue  lengths  of  the  centers  of  QN  will  be  N.  Further,  we  do 
not  know  of  a  consistently  good  means  of  incorporating  that  requirement  into 
the  computation  of  each  individual  queue  length  distribution.  A  method  we  have 
used  with  success  for  mean  queue  lengths  is  to  compute  them  at  all  centers,  and 
then  normalize  so  that  they  sum  to  the  multiprogramming  level.  By  performing 
this  normalization,  systematic  errors  in  the  queue  lengths  can  be  reduced. 


5.1.2  Analytic  Error  Bounds 


As  in  chapter  4,  our  measure  of  error,  E,  is  the  ratio  of  the  approximate 
throughput  to  the  actual  throughput: 


E  s 


0m{N) 


1  /p 


/ 


Gu(tr-i) 

C.u(N) 


We  have  been  unable  to  derive  any  general  analytic  results  about  the  error 
of  this  approximation  technique.  However,  it  has  been  our  experience  that  the 
approximate  throughput  is  an  overestimate  for  allp>l,  and  that  the  amount  of 
the  overestimate  for  a  network  of  a  given  size  is  always  less  than  that  observed 
for  a  balanced  network  of  the  same  size  (i.e.,  for  one  with  a  relative  utilization 
vector  that  has  all  entries  equal).  In  figure  5.2  we  graph  the  error  as  a  function 
of  the  balancing  of  the  load  for  a  network  with  6  centers,  8  customers,  and 
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aggregations  of  2  and  4  customers  into  1.  For  each  value  L  along  the  X-axis  we 
have  solved  the  system  with  relative  utilization  vector  X  given  by 

X  -  (1  1.1  1.2  1.3  1.4  1.5)* 

For  t—  0  the  system  is  balanced,  while  for  large  t  the  system  has  a  significant 
bottleneck.  Since  all.  networks  graphed  have  the  same  number  of  customers 
and  centers,  identical  subsets  of  states  are  selected  for  each  solution  with  a 
given  aggregation  size.  The  only  variable  that  is  changing,  therefore,  is  system 
balancing. 

20 
16 

% 

12 

Error 

8 
4 

123456789 

t 


Figure  5.2  -  System  Balancing  vs.  Error 

We  note  that  the  error  is  monotoniceJly  decreasing  with  increasing  t,  and 
that  the  effect  of  the  error  is  always  to  overestimate  the  system  throughput. 
Further,  the  larger  the  aggregation  size,  the  less  accurate  the  result.  This 
behavior  has  been  true  of  all  load  independent  networks  we  have  observed.  Our 
unproven  conjecture  is  that  the  error  E  is  bounded  by 


1£E 


N/v 

\/p 

r 

M  +JV  —  1 

M+Z--1 

P 

N 

(Conjecture) 


for  all  load  independent  networks  and  aggregation  sizes  p.  The  upper  bound 
represents  the  error  observed  if  the  network  is  balanced.  Appendix  C  presents  a 
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proof  that  this  quantity  is  greater  than  one  for  all  integer  p  larger  than  one.  It 
is  also  simple  to  prove  that  E  ->  1  as  the  network  becomes  saturated.  These 
results,  and  empirical  observation,  indicate  that  the  technique  always  produces 
an  overestimate. 

Figure  5.3  shows  the  upper  bound  as  a  function  of  M  and  N  for  p=2.  The 
bound  increases  with  M  and  decreases  with  N,  that  is,  it  is  tightest  for  saturated 
systems. 


50 
40 

% 

30 

Error 

20 
10 

1  2  3  4  5  6  7 

M 

Figure  5.3  -  Conjecture  Upper  Bound 

5.1.3  Examples 

Table  5.1  shows  the  percentage  error  in  throughput  and  mean  queue  length 
of  the  aggregation  procedure  for  a  number  of  networks  and  aggregation  sizes  p. 
The  first  number  of  each  pair  in  table  5.1  is  the  percentage  error  observed  for 
the  network  under  consideration.  (The  meaning  of  the  second  entry  will  be 
explained  in  the  next  section.)  Yl e  have  chosen  networks  with  varying  numbers  of 
service  centers  and  customers.  Additionally,  we  have  varied  the  system  balance 
by  altering  the  relative  utilization  vectors  according  to  the  balance  parameter 
B.  This  is  done  by  setting  the  relative  utilization  of  the  bottleneck  service 
center  to  1  in  all  cases,  and  setting  the  others  according  to: 
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B-. 5:  Half  the  centers  at  .25,  the  remainder  at  .5. 
B-. 8:  Half  the  centers  at  .5,  the  remainder  at  .8. 
B- 1:  Half  the  centers  at  .8,  the  remainder  at  1. 

Table  5.1  -  Relative  Errors 


p  =  2  p  =  4 


N 

M 

B 

ThrouehoulQueue  Length 

Thrqu«hpiit_ 

4 

2 

.5 

.8 

-.8 

0.0 

0.0 

1.8 

-2.8 

0.0 

0.0 

-1.2 

1.2 

-4.0 

-.2 

16 

2 

.5 

0.0 

-.1 

0.0 

0.0 

0.0 

-.5 

0.0 

0.0 

-1.4 

-.4 

-4.3 

-2.5 

64 

2 

.5 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

-.5 

-.2 

-1.6 

-1.0 

4 

2 

.8 

1.9 

-.1 

0.0 

0.0 

4.5 

-.7 

0.0 

0.0 

-.1 

.4 

-.4 

0.4 

18 

2 

.8 

0.1 

-.1 

0.0 

0.0 

.2 

-.4 

0.0 

0.0 

-.8 

-.2 

-1.9 

-1.3 

64 

2 

.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

-.5 

-.4 

-1.6 

-1.5 

4 

2 

1.0 

2.1 

0.0 

0.0 

0.0 

5.1 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

16 

2 

1.0 

0.2 

0.0 

0.0 

0.0 

0.5 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

64 

2 

1.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

4 

6 

.5 

9.6 

-9.0 

23.7 

-3.9 

15.3 

-21.4 

47.2 

-11.8 

n 

6.3 

-4.2 

3.1 

18 

8 

.5 

0.0 

-3.0 

7.3 

-1.0 

0.0 

-7.4 

11.9 

-6.3 

-3.9 

-1.2 

-11.9 

-0.4 

64 

6 

.5 

0.0 

-.3 

1.6 

-.4  . 

0.0 

-.8 

2.5 

-1.3 

-1.7 

-.9 

-5.4 

-4,2 

4 

6 

.8 

17.1 

-2.8 

38.0 

-4.6 

33.7 

-8.0 

75.3 

-20.4 

.6 

1.9 

0.6 

2.1 

16 

6 

.8 

1.2 

-1.8 

16.3 

-4,2 

2.4 

-4,8 

35.6 

-13.1 

.1 

1.1 

-1.9 

-.9 

64 

6 

.8 

0.0 

-.3 

3.3 

-1.3 

0.0 

-.7 

-8.0 

-5.6 

-1.9 

-1.8 

-6.0 

-5.6 

4 

6 

1.0 

19.8 

-.5 

38.8 

-4.1 

42.0 

-.16 

85.9 

-22.9 

-.1 

0.0 

-.2 

-.1 

16 

6 

1.0 

2.6 

-.4 

15.0 

-1.7 

6.1 

-1.1 

39.0 

-5.8 

-.4 

-.3 

-1.2 

-1.1 

64 

6 

1.0 

.1 

-.1 

4.1 

-2.5 

0.4 

-.3 

11.3 

-7.0 

-.9 

-.9 

-2.7 

-2.7 
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4 

16 

.5 

47.9 

-9.9 

103.5  -9.9 

02.3 

-24.3 

212.0 

-39.8 

5.7  8.2 

9.5 

12.2 

16 

-«  r“» 

io 

.5 

2.3 

-10.6 

47.3  -7.5 

2.9 

-22.1 

87.2 

-19.0 

4.3  6.5 

-2.8 

-1.1 

64 

16 

.5 

0.0 

-1.8 

7.0  -1.4 

0.0 

-4.5 

12.4 

-3.7 

-4.2  -3.7 

-13.0 

-12.3 

4 

16 

.8 

58.5 

-3.0 

100.9  -8.4 

120.2 

-7.2 

247.2 

-40.4 

.9  1.2 

2.2 

2.5 

16 

16 

.8 

11.5 

-2.5 

50.7  -5.0 

22.6 

-6.8 

146.9 

-13.5 

4.0  4.3 

9.9 

10.2 

64 

16 

.8 

.2 

-1.7 

18.5  -5.7 

0.3 

-4.2 

46.8 

-14.8 

1.9  2.0 

0.5 

0.6 

4 

16 

1.0 

62.0 

-.6 

95.2  -7.3 

133.8 

-1.8 

246.0 

-38.5 

0.0  0.0 

-.1 

-.1 

16 

16 

1.0 

13.7 

-.5 

37.6  -1.5 

29.3 

-1.5 

105.5 

-5.3 

-.1  -.1 

-.5 

-.5 

64 

16 

1.0 

1.5 

-.3 

12.2  -2.2 

3.6 

-1.0 

34.4 

-6.1 

-.7  -.7 

-2.3 

-2.2 

P 

=  8 

P 

=  16 

N 

M 

B 

ThroughputQueue 

Length 

Throughput  Queue  Length 

16 

2 

.5 

0.0 

-1.4 

0.0 

0.0 

0.0 

-3.7 

0.0 

0.0 

-9.3 

-6.6 

-15.4 

-11.6 

64 

2 

.5 

0.0 

-.1 

0.0 

0.0 

0.0 

-.2 

0.0 

0.0 

-3.9 

-2.8 

-7.8 

-6.0 

16 

2 

.8 

0.3 

-.9 

0.0 

0.0 

0.4 

-2.3 

0.0 

0.0 

-4.0 

-4.0 

1 

O 

b 

-9.2 

64 

2 

.8 

0.0 

-.1 

0.0 

0.0 

0.0 

-.2 

0.0 

0.0 

1 

o 

-3.8 

-8.7 

-8.4 

16 

2 

1.0 

1.0 

0.0 

0.0 

0.0 

1.7 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

64 

2 

1.0 

0.1 

0.0 

0.0 

0.0 

0.2 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

16 

6 

.5 

0.0 

-12.8 

13.8 

-11.0 

0.0 

-20.8 

14.6 

-4.2 

-23.9 

-19.6 

-35.7 

-30.4 

64 

6 

.5 

0.0 

-1.6 

3.0 

-2.5 

0.0 

-2.7 

3.1 

-1.0 

-12.4 

-10.4 

-22.6 

-19.7 

CD 

6 

.8 

3.3 

-8.8 

49.5 

-28.9 

3.7 

-14.3 

57.3 

-55.2 

-9.0 

-7.0 

_oo  o 

-18.9 

64 

6 

.8 

0.0 

-1.5 

9.8 

-8.5 

0.0 

-2.6 

11.4 

-17.0 

-14.1 

-13.6 

-26.6 

-26.0 

16 

6 

1.0 

9.8 

-2.5 

70.5 

-16.3 

12.8 

-4.7 

95.1 

-53.4 

-3.3 

-3.2 

-7.0 

-6.9 

64 

6 

10 

0.7 

-.7 

22.4 

-13.9 

1.1 

-1.4 

34.8 

-23.2 
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-6.3 

-8.3 

-12.6 

-12.5 

18 

16 

.5 

2.9 

-31.7 

104.2 

-34.2 

2.9 

-39.6 

-40.9 

-39.1 

-22.5 

-20.0 

-40.9 

-39.1 

64 

16 

.5 

0.0 

-7.7 

14.6 

-7.0 

0.0 

-11.0 

15.4 

14.0 

-27.5 

-28.6 

-44.3 

-43.4 

16 

16 

.0 

30.4 

-12.6 

267.4 

-28.7 

34.1 

-19.0 

358.0 

-69.7 

14.4 

14.1 

9.4 

9.7 

64 

16 

.0 

0.3 

-7.3 

75.9 

-26.5 

0.3 

-10.5 

93.6 

-30.0 

-9.4 

-9.3 

-27.7 

-27.5 

16 

18 

1.0 

43.7 

-3.2 

213.3 

-10.5 

54.5 

-5.8 

334.1 

-60.3 

-1.6 

-1.8 

-3.6 

-3.6 

64 

16 

1.0 

6.1 

-2.0 

71.7 

-12.1 

0.3 

-3.3 

126.4 

-20*6 

-5.4 

-5.4 

-10.5 

-10.5 

Cost  (Time) 


P 


N 

M 

1 

2 

4 

8 

16 

4 

2 

10 

6 

4 

16 

2 

34 

18 

10 

0 

4 

64 

2 

130 

66 

24 

10 

10 

4 

6 

30 

10 

12 

16 

6 

102 

54 

30 

18 

12 

64 

6 

390 

190 

102 

54 

30 

4 

10 

80 

48 

32 

16 

16 

272 

144 

00 

40 

32 

64 

16 

1040 

528 

272 

144 

40 

The  approximation  is  good  for  small  p,  large  N  (relative  to  M),  or  unbal¬ 
anced  networks.  It  is  worst  when  N  is  much  less  than  M.  Fortunately,  we  can 
expect  the  user  population  to  exceed  the  number  of  significant  hardware  dev¬ 
ices  in  the  types  of  computer  systems  to  which  these  large  queueing  models 
would  be  applied.  In  these  cases,  the  error  is  usually  quite  acceptable. 

Table  5.2  lists  the  exact  and  approximate  solutions  of  a  model  with  a  load 
dependent  drum  service  center.  The  results  here  are  within  2%  for  throughputs 
and  5%  for  mean  queue  lengths. 

5.1.4  Improving  the  Accuracy  of  the  Solution 
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Table  5.2  -  Load  Dependent  Model 


M=  4 
N  =  16 


A'  =  (2  3  4  4) 

o-drumM  =  (1  1.667  2.0  2.1  2.1  ...) 


Exact  Solution  -  Cost  =  187 


Device  Utilization  Queue  Length 
CPU  .4968  .9811 

Drum  .6823  1.3805 


Approximate  Solution  -  p=2  Cost  =  72 

Device  Utilization  Queue  Length 
CPU  .4976 

Drum  .6831  1.386 

There  are  a  number  of  methods  for  improving  the  accuracy  of  the  customer 
aggregation  approximation: 


1.  Increase  the  percentage  of  the  states  of  the  full  state  space  represented  by 
the  states  of  the  aggregate  network. 

2.  Improve  upon  assumption  A1  (that  the  proportion  of  the  probability  mass  of 
the  full  state  space  represented  by  the  aggregate  network  is  constant  over 
varying  customer  populations). 

3.  Improve  upon  assumption  A2  (that  the  network  has  the  same  throughput 
for  the  p  largest  customer  populations). 

4.  Use  an  error  estimation  scheme  to  improve  upon  the  approximation. 

Each  of  these  possibilities  is  examined  in  turn  in  the  remainder  of  this  sec¬ 
tion. 


5.1. 4.1  Increasing  the  size  of  the  aggregate  state  space 
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It  is  intuitively  clear  that  if  the  aggregate  network  represents  more  of  the 
state  space  of  the  full  network,  the  approximation  will  be  more  accurate.  One 
way  to  effect  a  larger  aggregate  state  space  is  to  reduce  p.  The  examples  listed 
in  table  5.1  show  that,  as  expected,  the  approximation  is  more  accurate  is  these 
cases.  Unfortunately,  the  smaller  the  aggregation  size  p,  the  greater  the 
expense  of  the  approximation.  One  way  to  increase  the  size  of  the  aggregate 
state  space  without  greatly  affecting  the  computational  requirement  is  to  select 
aggregates  of  differing  sizes. 


To  this  point  we  have  considered  only  the  aggregation  of  a  fixed  number  of 
customers  of  QN  into  a  single  customer  of  Q$.  It  is  possible,  however,  to  gen¬ 
eralize  this  aggregation  so  that  various  numbers  of  customers  of  QN  are 
represented  as  single  customers  of  QN.  In  this  section  we  examine  one  such 
aggregation. 

If  we  aggregate  only  some  of  the  customers  of  QN  in  groups  of  p ,  and  leave 
the  others  unaggregated,  the  set  of  states  of  QN  examined  by  the  solution  of  this 
aggregation  is  a  superset  of  those  examined  by  Qft.  Let  QN'  be  the  result  of  this 
partial  aggregation.  We  assume  that  the  customer  population  of  ON'  consists  of 


N 

P 


1  aggregate  customers  and  N-p( 


N 

P 


-1)  unaggregated  customers.  The 


relative  utilization  vector  of  the  aggregated  customers,  X',  is  given  by 


x1  =  {xy 

where  X  is  the  relative  utilization  vector  of  QN.  The  unaggregated  customers  of 
QN'  have  relative  utilization  vectors  equal  to  X. 

Since  the  aggregated  and  unaggregated  customers  have  different  relative 
utilization  vectors,  they  are  in  different  classes.  It  can  therefore  be  more 
expensive  to  obtain  the  exact  solution  of  QN'  than  that  of  QN.  However,  an 
efficient  ^approximate  solution  of  QN'  can  be  obtained  using  a  modification  of 
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Buzen’s  algorithm  [Buzen  71,73].  As  in  Buzen’s  algorithm,  there  are  M  vectors 
G'm,  771  =  1,..., AT.  The  first  \N  /p\  elements  of  each  G'm  are  used  to  introduce  the 
relative  probability  of  an  aggregated  customer  being  at  center  m.  The  remain¬ 
ing  entries  are  used  for  the  unaggregated  customers.  Performance  measures 
for  QN’  are  computed  in  the  standard  way  using  G'^. 


M 


1 

2 

3 

0 

1 

1 

1 

r> 

Gji 

16 

20 

21 

4 

256 

336 

357 

6 

4096 

5440 

5797 

8 

65536 

87296 

93093 

9 

262144 

4-36736 

529829 

Figure  5.4-  Modified  Buzen’s  Algorithm 


This  procedure  is  diagrammed  in  figure  5.4  for  N- 9,  p  =  2,  and  J=(4  2  1). 
The  first  five  entries  in  G'm  are  for  the  aggregated  customers,  and  are  computed 
using  Buzen’s  algorithm  with  relative  utilization  vector  A’’ =(16  4  l).  The  last 
entry  of  each  G'm  is  for  the  unaggregated  customer.  It  is  computed  using 
Buzen's  algorithm  with  relative  utilization  vector  (4  2  1). 


9;0;0 

4;5;0 

2;4;3 

0;4;5 

B;1;0 

4;4;  1 

2;2;5 

0;2;7 

8;0;1 

4;2;3 

2;0;7 

0;0;9 

6;3;0 

4;0;5 

*0;  9;  0 

6;  2;  1 

2;7;0 

0;0;  1 

6;0;3 

2;6;  1 

0;6;3 

Figure  5.5  -  States  of  Modified  Buzen’s  Algorithm 


Figure  5.5  shows  the  states  of  QN  represented  in  the  solution  of  QN'.  These 
include  some  states  with  one  odd  queue  length.  The  particular  subset  of  these 
states  that  is  contained  in  the  solution  of  QN'  is  determined  by  the  ordering  of 
the  service  centers  in  the  computation  of  the  G'm.  In  general,  those  centers  that 
appear  near  the  beginning  of  the  computation  are  represented  in  a  large 
number  of  states  as  having  long  queue  lengths.  This  indicates  that  the  centers 
should  appear  in  the  computation  in  decreasing  order  of  relative  utilization. 
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5.1.4.2  A  better  assumption  A1 


Assumption  A1  states  that  the  proportion  of  the  probability  mass  of  network 
QN  represented  in  aggregate  network  Q$  is  the  same  for  populations  N  and 
N—p.  In  general,  this  will  not  be  the  case.  Two  different  techniques  can  be  used 
to  make  the  necessary  adjustments  so  that  A1  is  nearly  true:  an  exact  transfor¬ 
mation  of  the  normalizing  constant  of  Q$  into  that  of  QN,  and  a  heuristic 
approximation.  We  begin  by  describing  the  exact  technique. 

A  sufficient  condition  for  assumption  A1  to  hold  is  that  Gy( n)  =  G^(pn)  for 
populations  and  (This  is  also  a  necessary  condition  if  A1  is  to  hold 

for  7i  —  0, , $ ,  since  Cm( 0)  —  £a/(0)  =  1.)  However,  since  £2/(71)  *  Ci/(pn)  in  gen¬ 
eral,  we  wish  to  find  some  function  (3(QN)  such  that 

Cutjpft)  =  SM(f))  0(QAT) 

We  can  derive  /?  in  the  following  manner.  Let  be  a  state  of  QN  and 

s=(ny,..,nn)  be  a  state  of  Q$.  (Here  rim  and  nm  represent  the  number  of  custo¬ 
mers  at  center  771  in  QN  and  Q$  respectively.)  We  say  that  s  is  derived  from  s 
with  respect  to  center  1  if 


rim-P 


rn  =  2,...,M 


M 


m  =  N-  Y, 


n 


m 


m= 2 


Intuitively,  s  is  the  unique  state  of  Q$  from  which  s  can  be  derived  simply  by 
moving  customers  from  center  1  to  the  other  centers. 


Let  Sj  be  the  set  of  states  of  QN  that  are  derivable  from  states  of  Q$  by 
moving  exactly  j  customers  from  center  1.  For  load  independent  QN,  the  unnor¬ 
malized  probability  of  states  Sj  is 


Ca 


■jk  £  P.ff) 

A  szj/v  -1 


j/p 

where  the  summation  is  taken  over  all  9  with  more  than 


P 


(5.6) 

—  1  customers  at 
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center  i<  P(a)  is  the  unnorm aiized  probability  of  state  s,  and 


-  E  & 

Ifij  «■! 


Here  the  summation  is  taken  over  all  distinct  choices  (with  repetitions)  of  j 
centers  from  centers  2,  such  that  no  center  is  chosen  more  than  p- 1 

times.  The  summation  of  (5.8)  can  be  rewritten  as 

h/jy  i 


k  =0 


(5.9) 


Where  Pi(k)  is  the  equilibrium  probability  of  observing  k  customers  at  center  1 
in  QN.  Since  Gy(N)  is  the  sum  of  the  unnormalized  probabilities  of  all  states  of 
QN, 


-  s  It 

from  which  we  get 


Pxik) 


n=0 


PW)=  1+  t  t  pdk) 


Lj 
3  =  1 


k  =0 


(5.10) 

for  load  independent  networks  QN.  Equation  (5.10)  can  be  extended  for  the  case 
that  center  1  is  load  dependent,  which  gives 


fS(QN)  =  1  + 


x{ 


1 
3  =  1 


2 


k- 3 /V  \p.=k-j  +  l 


n  ai(n)^i(fc) 


Similar  expressions  also  hold  for  networks  with  more  than  one  load  dependent 
service  center,  with  the  queue  length  probabilities  at  center  1  replaced  by  joint 
probabilities  of  the  load  dependent  service  centers.  Finally,  we  note  that 
simplifications  to  the  formulae  for  load  dependent  networks  are  possible  in  the 
case  of  limited  queue  dependent  behavior,  that  is,  when  0^(71)  =  a^f)  for  all 


?1  Z=l, 


Equation  (5.10)  gives  an  exact  transformation  of  C^(n)  into  0^{pn).  The 
complexity  of  the  transformation  is  determined  primarily  by  the  computation  of 
the  Cj.  These  can  be  computed  according  to  the  following  algorithm,  which  is  a 
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special  case  of  Buzen’s  load  dependent  algorithm. 


figure  5.6  -  Algorithm  for  Cy 
INITIALIZE: 

A  [rt,  2]  =  X £,  n=0,...,p  -1 

COMPUTE: 

For  m:=3  until  M  do 

For  n:=  1  until  (p  - 1  )(m  - 1)  do 

A  [n,m]  =  XJ  A  [n-j'.m-lp^ 
y=o 

^[-7  AM  vzM2 

The  value  Cy  is  then  given  by  — u'  The  algorithm  requires  time  — )  and 

-A  ^ 

space  0  ((p  —  1)(A/'  —  1))  (since  the  columns  of  A  can  be  re-used). 


Equation  (5.10)  can  be  used  to  ensure  that  assumption  A1  is  met.  This  can 
dramatically  improve  the  performance  measure  predictions.  However,  the  cost 
of  computing  the  Cy  may  be  great  for  large  networks. 

If  the  service  centers  of  QN  are  ordered  so  that  service  center  1  is  the 
bottleneck,  i.e.,  X\  ^  Xm,  then  we  can  think  of  ft  as  a  function  whose 

terms  diminish  as  j  increases.  In  particular,  it  is  true  that  if  we  expand  Cy  into 
the  terms  of  which  it  is  composed  (for  a  suitable  ordering  of  those  terms),  the 
terms  of  the  series  of  ft  thus  created  are  non-increasing.  This  suggests  that  rea¬ 
sonable  approximate  transformations  of  C/V(iV)  into  C,w(pitf)  can  be  made  by 
truncating  the  series  for  all  Cy  such  that  j  >  z,  z  <MIN((p  -l)(M -l),pN). 


We  have  performed  this  truncation  for  the  network  described  below  with 
two  customer  populations.  The  rows  for  0  Terms  correspond  to  adjusting  &m($) 
by  multiplying  by  one,  i.e.,  these  rows  correspond  to  the  basic,  unadjusted  tech¬ 
nique  described  in  section  5.1.1. 


Since  the  computation  of  the  adjustments  ft  can  be  expensive  for  large  net¬ 
works,  we  have  developed  heuristics  that  perform  the  same  task  much  more 
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X  =  (1  2  3  4  5) 


II 

5? 

# 

e(tf-D  ,e(pft-P) 

G(pft)  '  G(pft) 

Terms 

C(pAT) 

G(pft-p) 

0 

.288 

.393 

1.3645 

1 

.684 

.750 

1.097 

2 

.961 

1.000 

1.041 

3 

.997 

1.000 

1.003 

4 

1.000 

1.000 

1.000 

S2J 

II 

03 

# 

ectf-o 

e(ff-i )  c(pD-P) 

G(p$)  G(pf?) 

Terms 

G(pft) 

G{pft-p) 

0 

.231 

.249 

1.080 

1 

.640 

.655 

1.023 

2 

.927 

.939 

1.013 

3 

.994 

.995 

1.001 

4 

1.000 

1.000 

1.000 

cheaply,  although  somewhat  less  accurately.  These  heuristics  are  based  on  the 
very  simple  jS  that  results  when  the  network  is  either  very  saturated  or  bal¬ 
anced.  (Reiser  [79a]  has  also  used  heuristics  based  on  characterizing  networks 
as  either  balanced  or  saturated  to  approximate  the  solution  of  models  with 
many  chains.)  In  a  balanced  network  with  N  customers,  each  state  has  unnor¬ 
malized  probability  XN,  where  X  is  the  relative  utilization  of  all  service  centers. 
Therefore,  the  total  unnormalized  probability  mass  in  the  network  is  simply  XN 
times  the  number  of  states  in  that  network.  The  number  of  states  of  a  network 
QN  with  M  service  centers  andiV  customers,  S(M,N),  is 

Since  the  aggregate  approximation  of  a  balanced  network  is  also  a  balanced  net¬ 
work,  the  proportion  of  the  probability  mass  of  QN  included  in  Qfl  for  aggrega¬ 
tion  size  p,  F  ( M,N,p  ),  is 


F(M,N,p )  = 


Sm(~) 

_ 

Gm(N) 


(XP'f/P  S(M,N/p) 
XN  S(M,N) 


.  SIUM/P) 

S(lf,N) 

A  good  approximation  for  the  throughput  of  a  balanced  network  is  therefore 
given  by 

=  (fTm)'/P 

where  F  s  F(M,N,p)/F(U,N -p,p). 

As  a  network  becomes  more  saturated,  the  only  state  with  non-zero  asymp¬ 
totic  probability  is  that  state  in  which  all  customers  are  at  the  bottleneck  ser¬ 
vice  center.  This  state  is  represented  in  the  aggregate  network  (given  that  p 
divides  N),  so  the  ratio  of  the  probability  mass  of  QN  in  Qft  is  asymptotically 
one.  A  good  estimate  of  the  throughput  of  a  saturated  system  is  therefore 

Tm  w  Tip 

An  approximation  for  the  throughput  of  an  arbitrary  network  QN  can  be 
computed  by  linear  interpolation  between  the  two  extremes  of  network  types: 


Tm  =  (?«((! -a)+oF))Vp  (5.11) 

where  a  is  a  indicator  of  how  balanced  the  system  is  (gj=1  for  a  balanced  system, 

w=0  for  a  saturated  system,  CKcjcI  otherwise). 

Table  5.3  lists  the  approximations  obtained  using  both  the  basic  scheme 
(equation  5.4)  and  the  modification  (equation  5.11).  The  weighting  function  used 
was 


CJ 


(  2  Xrk)  ~  XSax 

m  =  1 


M-i 


E  Xm'X, 


m  +  t 


(5.12) 


m  =  l 

where  X^ax  —  MAX[Xm],  and  we  assume  that  the  Xm  have  been  sorted  so  that  X\ 

771 

is  the  largest  and  Xy  the  smallest.  The  results  using  the  interpolation  scheme 
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-  Table  5.3  -  Example  Network 

M  =  16 
N  -  16 

X  =  (.5  .5  .5  .5  .5  .5  .5  .5  .8  .8  .8  .8  .8  .8  .8  1) 
Exact  Solution  -  Cost  =  272 


System  Throughput:  .7371 

Q  ie=  2.25 


Approximate  Solution  -  p=2  Cost  =  144 

Basic  Scheme 
System  Throughput:  .8216  (11.5%) 

Q16:  1.973  (-12.3%) 


Modification 
.7077  (-4.0%) 

2.255  (.2%) 


Approximate  Solution  -  p=4  Cost  =  80 

Basic  Scheme  Modification 

System  Throughput:  .9040  (22.6%)  .6545  (-11.2%) 

Q  i6:  1.916  (-14.8%)  2.138  (-2.7%) 

are  generally  great  improvements  over  the  base  results.  The  interpolation 
scheme  fits  nicely  with  the  basic  scheme,  since  the  interpolation  is  very  accu¬ 
rate  for  balanced  systems,  while  the  basic  scheme  is  most  accurate  for 
saturated  systems.  Using  the  two  in  combination  results  in  a  more  robust 
approximation. 


5. 1.4. 3  Improving  the  throughput  assumption 

Assumption  A2  used  in  deriving  the  basic  approximation  scheme  is  that  the 
throughput  of  the  network  is  constant  for  the  p  largest  multiprogramming  levels 
T  (n ) 

(i.e.,  that  ---  ■*? - —r  =  1  for  n=N— p +  1,N -p +2,...,N -1).  This  is  clearly  an 

Am(n  +  1) 

extreme  assumption. 

At  the  other  extreme  (for  load  independent  networks)  is  the  assumption 
that  the  network  is  balanced.  Since  the  least  congestion  possible  for  a  fixed 
number  of  service  centers  and  customers  occurs  when  the  network  is  balanced, 
the  greatest  increase  in  throughput  when  introducing  an  additional  customer  is 
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observed  in  these  systems.  For  balanced  networks  we  have 

Fm(n. )  _  Om(N  ~  1)  y _ Gm  (N )  _  n  M+n 

Tm(n  + 1)  "  Gm(N)  /  Gm{N  +  1)  ~  n  +  l  i/  +n-l 

We  can  improve  the  throughput  approximation  (5.4)  by  making  use  of  the 
known  ratios  for  the  extremes.  This  is  done  by  interpolating  between  them  in 
the  same  manner  as  was  described  in  the  previous  section.  Since  the  basic 
approximation  (equation  5.4)  assumes  that  the  network  is  saturated,  the  less 
extreme  interpolated  approximation  increases  the  estimate.  This  means  that 
the  interpolation  should  not  be  used  alone  with  the  basic  estimate,  since  that  is 
already  an  overestimate.  However,  the  modification  of  equation  (5.11)  tends  to 
underestimate  throughputs,  so  combining  the  interpolation  with  it  results  in 

t 

more  accurate  estimates. 

The  interpolation  between  the  two  extremes  is  done  using  the  same  weight 
cor  as  in  the  previous  section  (equation  (5.12)).  The  interpolation  used  is 


=  (l-6v)?’m  +  CO, 


N 


M+N-l 


ft  M  +77,  —  1 

m  il 


T'~  FT 


l/p 


(5.13) 


n  -N  -p  + 1  n 

where  T’m  is  the  throughput  predicted  using  equation  (5.11).  The  first  term  of 
equation  (5.13)  represents  the  estimate  for  a  saturated  system,  and  the  second 


for  a  balanced  system. 


Table  5.4  -  Modified  Solution 

Approximate  Solution  -  p=2 

System  Throughput:  .7184  (-2.5%) 
Q  i6:  2.291(1.8%) 


Approximate  Solution  -  p=4 

System  Throughput:  .6872  (-6.8%) 

Q  i6:  2.302  (2.3%) 

Table  5.4  shows  the  result  of  applying  this  technique  to  t.he  model  of  table 
5.3.  The  error  in  predicted  throughput  is  almost  halved  in  comparison  with  the 
previous  best  estimate  (4%  to  2.5%  forp=2,  and  11%  to  7%  forp=4). 
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5. 1.4.4  Improvement  using  error  estimation 

A  standard  technique  in  numerical  analysis  is  to  use  the  approximations 
obtained  with  two  different  parameter  settings  (e.g.r  step  sizes  in  a  quadrature 
technique)  to  obtain  an  approximation  for  the  error.  This  error  approximation 
is  then  used  to  modify  the  result  with  the  intention  of  improving  the  estimate 
with  only  a  marginal  increase  in  computational  effort. 

A  similar  scheme  can  be  used  to  improve  upon  the  estimates  made  for 
throughputs  using  two  different  values  for  p.  Given  aggregation  sizes  p\  and  ^2. 
p  i  <  Pzi  we  can  compute  throughput  approximations  T*m{N\p{)  and  Tm(N  \pz). 
Their  difference,  d  =  7*m(N  \p2)—¥m(N  \p  t),  can  be  used  as  an  indicator  of  the 
error  in  7* m{N  jpi).  This  is  done  by  scaling  d  by  a  factor  that  would  be  correct  if 
the  network  were  balanced. 

Table  5.5  shows  the  results  of  this  technique  on  the  example  of  table  5.3 
using  p 2  =  2 pi.  For  this  choice  of  p  the  modified  approximation  is 

Tm(N)KTm(N\Pl)  -  ud 
where 

J^Njp)-  Tm(N) 

^  Tm(N\  2v)~  Tm(N\p) 


2  iV 

2/p 

P 

N 

2 N 

M+— — 1 
V 

M  +N  - 1 

K 

p 


M+X- 1 

P 


L'P 


2N 

P 


2  N 

M+--1 


W 


This  technique  works  quite  well,  even  for  large  p.  Since  it  is  cheaper  to 


solve  the  network  for  both  p  and  2 p  than  it  is  to  solve  it  for  only  p /2,  this  tech¬ 
nique  can  result  in  a  considerably  more  accurate  result  at  a  reduced  expense. 
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Table  5.5  -  Error  Estimation  Modification 
Using  T  from  (5.4) 


P  l 

P  2 

Cost 

%  Error 
T(N|Pl) 

%  Error 
T(N!p2) 

Estimate 

2 

4 

224 

11.5 

22.6 

2.1 

4 

8 

128 

22.6 

30.4 

8.4 

3 

16 

80 

30.4 

34.1 

18.0 

Using  T  from  (5.13) 


Pi 

Pa 

Cost 

%  Error 
T(Nip0 

%  Error 
T(NM 

Estimate 

2 

4 

224 

-2.5 

-6.8 

1.0 

4 

8 

128 

-6.0 

-12.6 

3.9 

8 

16 

80 

-12.6 

-19.0 

8.6 

5.1.5  Single  Class  Conclusions 

The  second  set  of  numbers  in  tabic  5.1  is  the  results  of  applying  the  tech¬ 
niques  described  in  sections  5. 1.4.2  and  5. 1.4.3  to  a  number  of  load  independent 
models.  The  modifications  improve  the  accuracy  of  the  approximation 
significantly.  In  all  cases  that  we  have  tried,  the  approximate  throughputs  (and 
therefore  utilizations)  obtained  using  these  modifications  have  been  slight 
underestimates.  On  the  other  hand,  the  basic  scheme  overestimates  the  same 
measures.  It  is  therefore  reasonable  to  interpret  the  two  approximations  as 
bounding  the  true  solution.  (The  exact  solution  nearly  always  lies  closer  to  the 
modified  approximation  than  to  the  basic  approximation,  however.) 


Figure  5.7  shows  the  error  in  throughput  predicted  by  the  aggregation  pro¬ 
cedure  as  a  function  of  the  customer  population  for  a  sample  network.  The  net¬ 
work  chosen  is  neither  balanced  nor  strongly  bottlenecked,  since  the  aggrega¬ 
tion  procedure  is  exact  in  those  cases.  As  expected,  the  technique  is  more 
accurate  for  larger  populations.  This  occurs  both  because  the  network  is  more 
nearly  saturated  at  high  loads,  and  because  a  fraction  of  a  large  population  can 
better  represent  the  behavior  of  the  entire  population  than  the  same  fraction  of 
a  smaller  population. 


Table  5.6  shows  the  results  of  this  technique  applied  to  a  network  with  a 
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X=  (1  2  3  ...  19  20) 


t 

Figure  5.7 

load  dependent  service  center.  The  load  dependent  center  represents  a  sector 
sorted  drum,  and  the  load  dependent  rates  reflect  the  fact  that  the  drum 
throughput  may  increase  with  the  number  of  outstanding  requests. 

The  results  of  the  approximation  with  two  different  sets  of  model  parame¬ 
ters  are  shown.  In  one  set  the  drum  is  the  bottleneck,  and  in  the  other  It  is  not. 
In  both  cases  the  throughput  estimate  (for  p=2)  is  within  1%.  The  error  in  mean 
queue  length  at  the  drum  is  more  variable,  at  .5%  for  the  first  example  and  9.5% 
for  the  second. 

5.2  MULTIPLE  CLASS  NETWORKS 

The  same  general  aggregation  procedure  that  we  used  for  single  class  net¬ 
works  can  be  applied  to  multiple  class  networks.  However,  because  the  equili¬ 
brium  state  probabilities  are  more  complicated  for  these  networks,  the  scheme 
must  be  slightly  modified.  In  this  section  we  describe  the  adjustments  neces¬ 
sary  for  multiclass  networks. 


5.2.1  The  Basic  Technique 
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Table  5.6  -  Load  Dependent  Drum  Models 

N  =  16 
M  -  6 

Q-DRUfJ  —  ( 1  1-668  2.0  2.1  2.1  ...  ) 


Drum  Bound  Model  - 

X=  (  3  2  2  1  5  9) 

Exact  Solution  -  Cost  =  221 

Device  Utilization  Queue  Length 
CPU  .5831  1.3642 

Drum  .9312  4.8008 

Approximate  Solution  -  p  =  2  Cost  =  90 

Device  Utilization  Queue  Length 

CPU  .5864 

Drum  .9335  5.058 


CPU  Bound  Model  - 

*  =  (4  1  1  2  2.5  6) 

Exact  Solution  -  Cost  =  221 

Device  Utilization  Queue  Length 

CPU  .9906  9.7952 

Drum  .8615  2.9896 

Approximate  Solution  -  p  =  2  Cost  =  90 

Device  Utilization  Queue  Length 

CPU  .9938 

Drum  .8628  3.056 

The  justification  for  the  customer  aggregation  procedure  for  single  class 
networks  is  that  the  ratios  of  states  of  Q$  are  equal  to  the  ratios  of  correspond¬ 
ing  states  in  QN.  This  property  is  achieved  simply  by  taking  the  pih  power  of  the 
relative  utilization  at  each  service  center  in  QN  as  the  relative  utilization  for 
that  center  in  QR. 

If  we  consider  a  state  of  a  multiclass  network  to  be  described  by  a  listing  of 


the  customers  (including  ordering)  at  each  queue,  the  probability  of  a  state 
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where  4  is  a  vector  representing  queue  m,  is  the  class  of  the  customer 

in  station  n  of  queue  m,  and  Am(ri)  is  a  factor  representing  possible  load  depen¬ 
dent  behavior  of  service  center  m.  Because  the  unnormalized  state  probabili¬ 
ties  have  this  simple  form,  it  is  possible  to  create  an  aggregate  network  Qfi  for 
any  multiple  class  network  QN  such  that  corresponding  states  have  identical 
ratios.  We  do  this  by  taking  the  prth  power  of  the  relative  utilization  of  class  r  at 
each  center  in  QN  as  the  relative  utilization  of  the  corresponding  class  in  Qfi, 

where  pr  is  the  aggregation  size  of  class  r.  The  load  dependent  factors  for  the 
service  centers  of  QR,  are  defined  by 

Am{nhnz,  •  •  •  ,n#)  =  Am(p  jnj, p2n 2, ...,pR nR) 

QR  can  be  solved  using  any  product  form  network  solution  technique. 


The  throughput  of  class  r  in  QN  is  estimated  from  the  throughput  of  class  r 
in  QN  by  taking  the  prth  root. 

rp  _  ^APr 

1  r.m  —  1  r.m 

Implicit  in  this  approximation  are  the  multiclass  extensions  to  A1  and  A2. 

Mean  queue  lengths  can  also  be  estimated  in  the  same  manner  as  for  single 
class  networks.  For  load  independent  service  center  m,  the  mean  queue  length 

of  class  r  is  approximated  by 


Q  r.m  ~ 


m 


i/Pr 


OAff) 


For  load  dependent  service  centers,  we  can  estimate  the  queue  length  distribu- 
tion  Gu-m  in  exactly  the  same  manner  as  was  done  for  single  class  networks.  An 
estimate  of  the  mean  queue  length  can  then  be  computed  from  this  approxi¬ 
mate  queue  length  distribution. 


Table  5.7  shows  the  result  of  this  procedure  for  a  sample  network  with  two 
classes  and  sixteen  service  centers.  (The  first  set  of  queue  lengths  in  the  table 
for  the  approximation  are  for  the  computation  from  the  queue  length  distribu- 
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Table  5.7  -  Example  Network  and  Solution 
M  =  16 

Na  =32  Nb  —  32 
Pa  =  1  Pd  -  8 

^  =  (.5.5.5.5.5.5.5.5.8.8.8.8.8.8.9  1) 
XB  =  (1.5. 5. 5. 5. 5. 5. 5. 5. 8. 8, 8. 8. 8. 8. 8) 


Exact  Solution  -  Cost  =  34,848 


Utilizations 

Device 

Class  A 

Class  B 

Total 

Cl 

.2654 

.5455 

.8109 

C16 

.5308 

.4364 

.9672 

Queue  Lengths 

Device 

Class  A 

Class  B 

Total 

C16 

8.299 

7.004 

15.303 

Approximate  Solution  -  Cost  =  5,280 


Utilizations 


Device 

Class  A 

Class  B 

Total 

Cl 

.4273 

.9030 

1.330 

(64%) 

C16 

.8546 

.7224 

1.577 

(63%) 

Queue  Lengths 

Device 

Class  A 

Class  B 

Total 

.  C16(a) 

25.72 

29.26 

54.98 

(359%) 

C16(b) 

7.09 

6.30 

13.39 

(-13%) 

tion.  The  figures  in  the  second  row  are  computed  using  the  load  independent 
formula.)  The  predicted  throughputs  are  about  65%  high  for  both  classes.  This 
large  error  occurs  because  only  a  very  small  percentage  of  the  states  of  QN  are 
represented  in  Qft  using  this  technique.  It  is  therefore  important  that  adjust¬ 
ments  similar  to  those  described  for  single  class  networks  be  made.  In  the  next 
few  subsections,  we  describe  these  adjustments  in  greater  detail. 


5.2.1. 1  Increasing  the  size  of  the  aggregate  state  space 

We  can  increase  the  size  of  the  aggregate  state  space  in  two  ways.  One  is  to 
simply  decrease  any  of  the  pr.  The  other  is  to  use  the  load  dependent  convolu¬ 
tion  algorithm  so  that  the  states  of  Qft  correspond  to  marginal  states*  of  QN. 

* 

A  marginal  state  is  a  state  with  information  about  ordering  of  customers  in  queues  removed. 
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The  technique  we  use  is  the  following.  For  each  center  m,  construct  an  R 
dimensional  relative  utilization  matrix  by 


XmW  =  -zr1. 


(  Xj  Pr^V)* 


r  =  1 


R 


Am(ft)  (p1n1)f...(pJinR)!  /Jj 

,vr 

where  ft  =  (nltri2,....njj),  O^n^ - .  The  aggregate  performance  measures  are 

Vr 


then  obtained  using  the  standard  techniques  from  the  matrix  Sy,  where 


GM=Xi*X2*  ••  • 

(Here  is  the  convolution  operator.)  We  note  that  our  construction  of  the  Xm  is 
equivalent  to  selecting  every  prth  element  of  the  relative  utilization  matrix  Xm 
(in  dimension  r)  for  center  to  in  ON.  It  is  precisely  this  property  that  ensures 
that  the  ratios  of  the  state  probabilities  are  the  same  for  both  the  original  net¬ 
work  and  the  aggregate. 

Tabic  5.8  -  Convolution  Solution 
Cost  =  1,183,744 


Utilizations 


Device 

Class  A 

Class  B 

Total 

Cl 

.2442 

.6070 

.8512 

(5%) 

C16 

.4885 

.4856 

.9741 

(.7%) 

Queue  Lengths 

Device 

Class  A 

Class  B 

Total 

C16 

15.528 

16.630 

32.158 

(210%) 

Table  5.8  lists  the  results  of  applying  this  modification  to  the  model  of  table 
5.7.  As  can  be  seen,  the  modification  is  much  more  accurate  than  the  basic 
scheme  (utilization  errors  are  about  11%,  compared  with  65%  for  the  basic 
scheme).  However,  this  accuracy  is  bought  at  the  cost  of  a  great  deal  of  addi- 

Jl  Nr2 

tional  processing,  since  the  modification  requires  time  on  the  order  of  }  j 

r- 1 

R 

while  the  basic  scheme  requires  only  {TiVr. 

r=  1 


2 
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5.2.1. 2  A  better  assumption  A1 

As  in  the  single  class  case,  the  justification  for  taking  the  prth  root  of  the 
performance  measures  of  QiV  as  estimates  of  those  of  QM  depends  on  two 
assumptions  (which  are  the  multiclass  extensions  of  A1  and  A2).  One  of  these 
assumptions  is  that  the  percentage  of  the  probability  mass  of  the  states  of  QN 
represented  in  Q$  is  constant  for  populations  N-prer  and  This  is  clearly  not 
the  case.  There  does  not  seem  to  be  a  simple  function  (3  for  multiclass  networks 
corresponding  to  (5.10)  for  single  class  networks.  Because  of  this,  we  use 
heuristic  adjustments.  As  was  the  case  for  single  class  networks,  we  can  com¬ 
pute  the  required  adjustments  for  the  two  extremes  of  network  types:  balanced 
networks  and  saturated  networks.  Here,  a  network  is  balanced  with  respect  to 
class  r  if  the  relative  utilization  of  class  r  is  constant  for  all  centers.  A  network 
is  saturated  with  respect  to  class  r  if  one  of  its  service  centers  dominates  to  the 
(near)  exclusion  of  the  others.  Given  the  adjustment  factors  required  for  these 
two  extremes,  we  can  compute  an  approximate  adjustment  factor  for  an  arbi¬ 
trary  network  by  linear  interpolation  using  some  weight  that  represents  how  bal¬ 
anced  the  network  is  (exactly  as  we  did  for  single  class  networks). 

The  particular  weight  we  used  is  simply  the  single  class  weight  applied  to 
each  class  of  the  multiclass  network  individually.  The  weight,  cjr,  of  class  r  is 
computed  as 

(  S  -*nm)  -  X?UAX 

771-1 

~  M~-\  . 

V1  Y  *Y 

/„  /  ^r.m  ^  r,  m  1 

m  =  1 

where  Xrj ^  s=  MAX  \Xrm).  The  interpolation  is  given  by 

m 

B*  _  f  ( 1- - CJr  )S  (1, # -pr Gr)  +  urS  (M, $  -pr t'r ) 

r‘m  r,m  (l-ar)S(l,tf)  +  arS(M,&) 


where 
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at-i+Xal 


s(M.rf)=  rr  n 


Y 


R  ”r  , 

n  n  i 


r  =  1 


*  =  !+■ 


AT 


This  modification  results  in  a  great  improvement  over  the  basic  scheme.  The 
approximations  obtained  for  the  network  of  table  5.7  is  shown  in  table  5.9.  Here 
the  throughput  error  has  been  reduced  from  about  65%  for  the  basic  scheme 
and  11%  for  the  marginal  state  scheme  to  approximately  3.6%. 

Table  5.9  -  Improved  Assumption  A1 

Cost  =  5,312 
Utilizations 


Devioe 

Close  A 

Class  Q 

Total 

Cl 

.244 

.572 

.816 

(a») 

C10 

.488 

.458 

.958 

(-2.2J5) 

Queue  Length 

Device 

Cleat  A 

Class  B 

Total 

C10 

3.851 

4.529 

8.180 

(-47*) 

5.2. 1.3  Improving  the  throughput  assumption 


An  assumption  analogous  to  assumption  A2  for  single  class  networks  is  used 
for  multiclass  networks  to  justify  the  performance  measure  approximations. 
For  multiclass  networks,  this  assumption  is  that  the  throughput  of  class  r  is  con¬ 
stant  for  multiprogramming  levels  Nr,  Nr—  1,  Nr-pr.  This  is  often  an  unrealis¬ 
tic  assumption,  especially  when  pr  is  large. 

We  can  improve  upon  this  assumption  by  interpolating  between  known 
adjustments  for  the  two  extremes  of  network  types:  saturated  networks  and  bal¬ 
anced  networks.  This  is  done  by 


Fr.m  =  (1  -Ur)*SAT.ADJ  +  ur*BAL.ADJ 

where  SAT.ADJ  is  the  adjustment  necessary  if  the  network  is  in  fact  saturated 
(with  respect  to  class  r),  and  BAL.ADJ  is  the  adjustment  required  for  balanced 


classes.  These  are  given  by 

BAL.ADJ  =ADJ{Tr<rnitf,M) 
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SAT.ADJ  =ADJ(Trrn.rf,  1) 
where 


ADJ(TKm,fi.K)  = 


R 


N. 


?r.m  FI 

n  =Nr  + 1  -pT 


K + 7i  +  Nr  —  1 

J  =  l.jVr 


n 


1/Pr 


AU 


/r+S^r-i 

J=1 


Table  5.10  lists  the  results  of  solving  the  example  network  using  the 
modifications  of  sections  5.2. 1.2  and  5.2. 1.3.  The  throughput  error  is  reduced  to 
about  3.6%  for  class  A  and  .6%  for  class  B. 


Table  5.10  -  Adjusted  Result 
Utilizations 

Device  Class  A  Class  D  Total 
Cl  .2742  .5031  .7773  (-4%) 

C16  .5485  .4024  .9509  (-2%) 


Queue  Lengths 

Device  Class  A  Class  B  Total 

Cl  6(a)  4.786  5.151  9.937  (-35%) 

C18(b)  7.130  5.911  13.041  (-15%) 


5.2.2  The  Adjusted  Technique 

In  this  section  we  summarize  the  recommended  aggregation  procedure  for 
multiple  class  networks.  This  technique  is  the  base  scheme  modified  by  the 
adjustments  of  section  5.2. 1.2  and  5.2. 1.3.  The  procedure  outlined  here  is  the 
one  used  for  the  examples  presented  in  the  next  section. 


GIVEN:  A  multiclass  queueing  network  QN  with  R  classes,  M  service  centers,  and 
Nr  customers  in  class  r.  The  relative  utilization  of  class  r  at  center  m 
is  Xrm.  If  center  m  is  load  dependent,  Am(fl)  is  its  load  dependent 

service  factor  for  the  customer  population  represented  by  rt. 


FORM  Q$:  Create  by  aggregating  many  customers  into  one.  For  each  class  r, 
choose  an  aggregation  size  pr.  The  multiprogramming  level  of  class  r 

Nr 


in  Qft  is  set  to 


Pr 


(ideally,  pr  should  divide  Nr).  The  relative 
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p 

utilization  of  aggregate  class  r  at  center  m  is  X The  load  depen¬ 
dent  service  factor  for  center  m  with  customer  population 
( li 71 2» •  •  • » 71/} )  ^3  1 H ' >P RUr) * 


SOLVE  Qft:  Solve  Qft  using  standard  normalizing  constant  algorithms  for  separ¬ 
able  networks.  This  results  in  a  matrix  Dy.  Estimate  the  perfor¬ 
mance  measures  of  class  r  at  center  m  in  QN  by 


(  S^r2/)  ~  X^UAX 
- 

I]  Xr,j*Xr,j+ 1 

J= 1 


=  *  1^'  n 


*  w, 

n  n 

Pr 


(t-ar)s (lj -prgr)  +  vrS  (ifj -pr4r) 
5U(R)  (i-«r)s(i,j9)  +  urs(if,i}) 


ADJ{frm,ft,K)  = 


S  K+n+  £ 

?r.m  il  - - 

n=Nr  +  \-pr 


71 


'/Pr 


Xr 


K+^Nr- 1 

i= 1 


?r.m  =  (i:-aT)ADJ(fnm,/t,  1)  +  arADJ  ft, M) 


Gfl  -  du(R)[(l-Or)ADJ(TT,m,R,  1)  +  uTA2J(Tr,m,Nvec,M) 


+  _  i=i _ 


ft  +  S 

*=i 


Cu*a(R-ir)  (t-u,*)s( l,^-er)  +  afS(Af+lR-gT) 

Gu+m0)  (l-o?)S(l,R)  +  afS(M+l,R) 
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SBm  =  ^  CUJrm0)  \(l-a*)ADJ(YmJ t  1)  +  ^ADJ(YmJ.M  + 1)] 

5.2.3  Multiple  Class  Examples 

Table  4.8  lists  the  results  of  applying  the  customer  aggregation  technique 
to  a  number  of  multiple  class  models.  The  throughput  and  queue  length  predic¬ 
tions  are  generally  within  10%  of  the  exact  solutions. 

The  technique  performs  best  for  nearly  balanced  classes  and  for  networks 
with  a  single,  nearly  saturated  bottleneck.  Much  of  the  error  of  the  technique 
results  from  the  inaccuracies  in  formulation  of  the  weight  used  in  the  interpola¬ 
tion  adjustment,  and  the  adjustment  factors  corresponding  to  perfectly  bal¬ 
anced  and  perfectly  saturated  networks.  These  inaccuracies  occur  because  our 
interpolation  scheme  relies  on  decomposition  of  classes;  i.e.,  we  consider  each 
class  independently  of  the  others.  Conceiveably  better  predictions  could  be 
made  using  more  complicated  expressions  for  the  interpolation  procedures. 

We  have  examined  the  effect  of  the  aggregation  sizes  used  by  solving  the 
model  listed  in  table  5.7  for  a  large  number  of  different  aggregations.  The 
results  (in  terms  of  percentage  error)  are  listed  below 

Here  C  is  a  measure  of  the  time  and  space  required  for  each  approximate  solu¬ 
tion  (C  =  1089  for  the  exact  solution). 

The  throughput  estimates  are  fairly  close  in  all  cases.  Even  the  solution 
using  an  aggregate  network  with  only  one  customer  in  each  class  approximates 
throughputs  within  12%.  The  approximation  is  robust  because  throughputs  are 
computed  as  ratios  of  two  adjacent  elements  of  An  error  in  one  of  the  ele¬ 
ments  is  quite  closety  reflected  by  an  error  in  the  adjacent  element.  Taking 
their  ratio  tends  to  diminish  the  effect  of  any  errors. 


-  124  - 


Class  A  Class  B 


c 

Pa 

Vb 

TPUT 

QLEN 

TPUT 

QLEN 

1089 

1 

1 

0.0 

0.0 

0.0 

0.0 

297 

4 

1 

-4.5 

4.1 

.4 

9.7 

165 

1 

8 

3.3 

-14.1 

-7.8 

-23.7 

153 

2 

4 

-.4 

-3.6 

-3.6 

-7.4 

153 

4 

2 

-4.4 

3.0 

6.9 

5.3 

81 

4 

4 

-3.6 

-3.3 

-4.1 

-3.7 

66 

32 

1 

-4.2 

-43.3 

.1 

-30.7 

18 

32 

4 

-5.6 

-49.6 

-5.3 

-41.9 

4 

32 

32 

-8.6 

-54.6 

-11.1 

-56.4 

Queue  length  estimates,  on  the  other  hand,  are  computed  by  summing  over 

all  the  elements  of  &M-m  and  dividing  by  The  error  involved  with  each 

term  of  the  summation  is  probably  larger  than  the  throughput  error,  since  the 

ratios  here  are  not  of  adjacent  elements.  This  error  is  compounded  by  summing 
R 

over  the  F[(ATr+l)  elements.  We  therefore  expect  the  queue  length  estimates  to 

r=  1 

be  less  accurate.  This  is  shown  in  the  table  above. 

5.3  SUMMARY 

In  this  chapter  we  have  presented  an  approximate  solution  technique  based 
on  the  aggregation  of  customers.  The  approximation  can  be  viewed  as  a  method 
for  obtaining  partial  information  about  some  of  the  states  of  the  full  network, 
and  using  this  information  to  estimate  performance  measures  of  that  network. 

The  procedure  is  applicable  to  both  single  and  multiple  class  networks  with 
any  combination  of  load  independent  and  load  dependent  service  centers.  It 
performs  best  for  single  class  networks  and  multiple  class  networks  with  a 
significant  bottleneck.  In  addition,  heuristic  adjustments  to  the  basic  technique 
give  good  results  for  networks  with  reasonably  balanced  workloads. 


-  125  - 


CHAPTER  6  -  AGGREGATION  OF  SERVICE  CENTERS 

In  this  chapter  we  examine  the  aggregation  of  service  centers  as  an  approx¬ 
imation  technique.  As  always,  our  goal  is  to  obtain  good  approximate  solutions 
for  separable  queueing  networks  at  reduced  expense  in  comparison  with  exact 
solution  techniques.  Our  interest  in  aggregation  of  service  centers  is  motivated 
by  the  fact  that  the  behavior  of  queueing  network  models  of  computer  systems 
is  often  determined  by  a  relative  few  of  the  total  number  of  centers  in  the  net¬ 
work.  Such  networks  correspond  to  computer  systems  with  many  hardware  dev¬ 
ices,  only  some  of  which  are  highly  utilized.  Since  the  behavior  of  these  models 
is  determined  by  only  a  few  of  their  centers,  the  error  incurred  by  replacing  the 
unimportant  subset  of  centers  by  a  single,  nearly  equivalent  center  is  small. 
Our  goal,  then,  is  to  develop  a  procedure  for  cheaply  computing  these  nearly 
equivalent  centers  in  a  way  that  reduces  the  effort  required  to  solve  the  entire 
network,  while  still  producing  reasonably  accurate  predictions. 

Section  6.1  discusses  previous  work  in  this  area,  and  presents  the  key  con¬ 
cepts.  Section  6.2  presents  the  approximation  technique,  and  section  6.3  gives 
the  results  for  a  number  of  examples.  Finally,  section  6.4  describes  a  simpler 
alternative  technique  that  may  be  used  for  some  networks. 

6.1  DECOMPOSITION  AND  FLOW  EQUIVALENCE 

In  computer  system  modeling  as  in  computer  programming,  it  is  often  the 
case  that  the  object  being  created  (the  model  or  the  program)  is  too  complex  to 
be  constructed  as  a  monolith.  Instead,  the  task  must  be  broken  down  into  more 
manageable  pieces,  with  clear  interfaces  controlling  relationships  between 
them. 

In  systems  modeling,  reduction  of  a  large  model  into  its  component  parts  is 
called  decomposition.  Decomposition  is  often  applied  wThcn  activities  in  the 
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system  are  progressing  at  substantially  different  rates.  For  instance,  consider  a 
time  sharing  system  with  a  limited  degree  of  multiprogramming.  (For  simplicity, 
we  assume  that  once  a  transaction  enters  the  multiprogramming  mix  it  remains 
there  until  completion.)  There  are  two  very  different  time  scales  in  this  system. 
One  is  represented  by  the  long  term  scheduler,  and  involves  the  admission  of 
jobs  to  the  multiprogramming  mix.  The  other  is  represented  by  the  short  term 
scheduler,  and  involves  such  events  as  the  processing  of  I/O  requests,  many  of 
which  may  happen  between  entry  and  exit  from  the  mix. 


I/O 

Figure  6.1  -  Low  Level  Model 


Because  of  this  disparity  in  time  scales,  it  is  natural  to  decompose  the 
model  of  the  system  into  two  submodels,  one  for  each  time  scale.  The  low  level 
model  (short  term  scheduler)  might  look  like  figure  6.1.  Service  centers  in  this 
model  represent  the  hardware  resources  that  transactions  in  the  multiprogram¬ 
ming  mix  contend  for.  All  customers  in  the  model  are  assumed  to  be  in  the  mix. 
Tt  is  the  function  of  the  high  level  model  to  reflect  contention  to  enter  the  mix. 

The  solution  of  the  low  level  model  yields,  among  other  performance  meas¬ 
ures,  the  system  throughput  (or  job  completion  rate)  as  a  function  of  the  custo¬ 
mer  population.  The  throughputs  obtained  for  all  possible  multiprogramming 
mixes  (i.e.,  from  one  customer  to  the  multiprogramming  limit  in  the  single  class 
case)  can  then  be  used  to  construct  the  high 


level  model. 
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Terminals  Multiprogramming  Mix 

Figure  6.2  -  High  Level  Model 

Figure  G.2  shows  a  typical  high  level  model.  The  two  service  centers 
represent  the  two  places  that  transactions  can  be  (at  this  level),  either  at  the 
terminal  (i.e,  dormant)  or  contending  for  entry  to  the  multiprogramming  mix. 
The  model  is  specified  by  the  number  of  active  terminals  (corresponding  to  the 
number  of  potentially  active  transactions),  the  average  "think”  time  or  time 
between  submission  of  transactions  from  a  terminal,  and  the  service  rates  of  the 
center  representing  the  multiprogramming  mix  as  a  function  of  the  number  of 
transactions  there.  These  rates  are  reasonably  approximated  by  the  population 
dependent  throughput  rates  obtained  from  the  solutions  of  the  low  level  model. 

Because  the  detail  of  the  central  subsystem  has  been  subsumed  in  the  high 
level  model  by  a  single  service  center  that  approximates  the  rate  at  which  cus¬ 
tomers  flow  between  the  subsystem  and  the  rest  of  the  model,  the  service 
center  is  said  to  be  approximately  flow  equivalent  to  the  subsystem  [Chandy  and 
Sauer  78,80].  Flow  equivalent  centers  are  important  tools  in  systems  modeling, 
and  are  the  basis  of  the  center  aggregation  technique  presented  in  the  next  sec¬ 
tion. 

Chandy  et  al.  [75]  showed  that  exact  flow  equivalent  centers  exist  for  any 
subset  of  the  centers  of  a  separable  network.  This  is  easily  seen  by  considering 
the  convolution  solution  technique  for  separable  networks.  Let  centers  1,2,...,S 
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be  the  service  centers  for  which  a  flow  equivalent  server  is  desired.  The  matrix 
G$  that  is  the  result  of  convolving  the  S'  relative  utilization  matrices  correspond¬ 
ing  to  the  centers  of  the  subset  is  independent  of  all  centers  not  in  the  subset. 
If  we  think  of  G$  itself  as  one  of  the  A"  matrices  used  in  the  convolution  algorithm 
to  form  CM,  it  describes  the  flow  equivalent  center  for  the  S  centers  of  the  sub¬ 
set.  It  is  an  exact  flow  equivalent  center  because  the  computation  of  Gm  (i.e., 
the  solution  of  the  network)  that  results  by  convolving  in  the  remainder  of  the 
centers  is  the  same  regardless  of  whether  one  starts  with  the  5  centers  of  the 
subsystem  or  the  flow  equivalent  center  represented  by  Gg-  Using  the  tradi¬ 
tional  explanation  of  a  flow  equivalent  service  center  as  one  that  captures  the 
throughput  rate  of  the  aggregated  subsystem  as  a  function  of  the  customer 
population  in  the  subsystem,  Gg  is  exact  because  the  ratios  of  its  adjacent  ele¬ 
ments  are  exactly  the  required  population  dependent  throughputs. 

Since  the  convolution  solution  holds  only  for  separable  networks,  it  is  not 
the  case  that  exact  flow  equivalent  centers  always  exist.  Sauer  and  Chandy  [75] 
consider  the  problem  of  obtaining  approximate  flow  equivalent  centers  for  more 
general  networks.  Vantilborgh  et  al.  [80]  examine  techniques  for  decomposing  a 
network  into  subsystems  (each  of  which  is  to  be  replaced  by  an  approximate 
flow  equivalent  center)  in  an  effort  to  minimize  the  error  caused  by  this  decom¬ 
position.  A  more  complete  overview  of  decomposition  and  flow  equivalent  tech¬ 
niques  is  given  in  [Chandy  and  Sauer  80]. 

6.2  THE  TECHNIQUE 

The  goal  of  the  aggregation  technique  is  to  compute  cheaply  an  approxima¬ 
tion  for  the  flow  equivalent  service  center  that  represents  a  subset  of  the  service 
centers  of  a  separable  queueing  network  model.  In  order  to  do  this,  reasonable 
approximations  for  the  throughputs  of  that  subset  as  a  function  of  the  customer 
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population  in  it  must  be  found.  Of  the  two  techniques  available,  class  and  custo¬ 
mer  aggregation,  the  latter  is  more  appropriate  for  this  task.  Since  the  solution 
of  a  queueing  network  using  class  aggregation  produces  results  for  only  a  single 
customer  population,  many  solutions  are  required  to  construct  the  flow 
equivalent  center.  We  therefore  restrict  our  discussion  of  this  problem  to  the 
application  of  the  customer  aggregation  technique. 

Let  S  be  the  set  of  service  centers  for  which  an  approximate  flow  equivalent 

center  is  to  be  computed.  The  exact  flow  equivalent  center  can  be  described  by 
a  matrix  Gs,  which  represents  the  convolution  solution  matrix  resulting  from 
solving  the  subsystem  S'  in  isolation  with  the  full  customer  population.  An 
approximation  to  Gs,  2s,  can  be  obtained  by  applying  the  customer  aggregation 
procedure  to  S,  This  procedure  produces  a  matrix  2$  as  the  result  of  the  con¬ 
volution  operations.  Each  element  2s(nll7i2f...ln/*)  an  approximation  to 
Osip \n 2^2, -  Approximations  for  the  other  elements  of  Gs  can  be 

computed  using  a  procedure  similar  to  that  used  to  approximate  throughputs  in 
the  customer  aggregation  technique.  There  GM(N~l)  was  estimated  from 
approximations  to  Gu{N-p)  (i.e.,  0^{N/p- 1))  and  G^(N)  (i.e.,  On(N /p)). 
Similar  techniques  can  be  used  to  approximate  allp-1  elements  of  Gu  that  fall 
between  G^(n )  and  &M(n+  1). 

The  specific  technique  used  to  create  an  approximate  flow  equivalent  center 
is  the  following.  For  each  class  r,  choose  an  aggregation  size  pr.  Create  a  net¬ 
work  5s  like  S  except  that  the  multiprogramming  level  of  class  r  in  <5  is  Nr/pr, 

and  the  relative  utilization  of  class  r  at  center  m  in  S  is  X^!m.  Solve  S  using  the 
standard  normalizing  constant  technique.  This  gives  a  matrix  0$,  which  holds 
the  normalizing  constants  for  §  for  a  number  of  customer  populations.  & s  pro- 
vides  some  information  about  Gs •  The  elements  of  Gs  must  be  scaled  since 
different  proportions  of  the  total  probability  mass  of  S  are  represented  in  the 
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elements  of  Os  aeeofdifig  to  the  otistomef  population  to  which  each  element 

corresponds.  This  adjustment  is  performed  by  linear  interpolation  between  the 
adjustments  required  for  each  of  the  two  extremes  of  network  types:  saturated 
and  balanced  networks.  The  adjustment  used  is 

^s(pini,p2nzl...,PRnR)  =  [(l-u/)fi>  (l.fl)  +  wS(M,ri)]  Gs(rt) 
where 


ft  -  (/I  j,  Tig, . . . ,  71ft  ) 

Jf-H &Jir 

S(M,fi)=  If  3 
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r  =  l 


n  *  r  -1 

n  n  { 

N.  K 


*  =  !+■ 


Pr 


and  w  is  a  weight  indicating  how  balanced  5  is.  In  our  experiments  w  has  been 


computed  as 
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,m  =  l 


i— 1  r  ,m 


_  y  2 
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eK*. 

m  =  r 


*x 

m  r+,m+ 1 


where  r+  is  the  class  with  the  largest  multiprogramming  level,  MAX  is  the 
bottleneck  service  center,  and  the  centers  are  ordered  such  that  center  1  has 
the  highest  relative  utilization  and  center  M  the  smallest  (for  class  r). 


rw 

To  this  point,  only  those  elements  of  £5  that  directly  correspond  to  ele¬ 
ments  of  Os  have  been  computed.  The  remaining  elements  are  estimated  by 
interpolating  between  "known"  elements.  This  is  done  by  ordering  the  classes  in 
ascending  pr,  and  then  interpolating  in  dimension  r  according  to  this  ordering. 
For  example,  in  a  two  class  network  with  N j  =  3,  p  1  =  3,  N2  =  4,  and  p2  =  2, 


£9(0,  0),  Gs( 0,  2),  £.9(0,  4),  £5(3,  0),  £5(3,  2),  and  £9 (3,  4)  would  be  approximated 
first  from  the  corresponding  elements  of  £5.  Then  £5(0,  l),  Gs(0,  3),  £5 ( 3,  1), 
and  £9(3,  3)  are  computed  by  interpolating  in  dimension  2..  Finally,  the  remain¬ 
ing  elements  would  be  approximated  by  interpolating  in  dimension  1. 


The  interpolation  in  dimension  r  is  performed  in  a  manner  identical  to  that 
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used  to  estimate  throughputs  in  the  customer  aggregation  technique.  First,  a 
weight  ufr  indicating  how  balanced  class  r  is  is  computed  by 


= 


(  2  Xrm)  ~  Xt.UAX 
m=l 

2 


m=l 

Then,  between  previously  estimated  elements  £5(711,712... .,7V,. ..,7i/j)  and 
^si71 1.  7i2,.",Tir+pr,...,TiR)  all  intervening  elements  are  estimated  using 


s  (7i  7lr+ = 

_ &s(n1)n2 . 7V+J-1 . nj>) _ 

(l-wr)*ADJ  {nr+j,ii,  1)  +  wr*ADJ  (nr-rj,?i,M) 
R 

where  1  ^  j  ^ pr **- 1,  N  =  2  TV-  and 
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This  adjustment  is  exactly  the  heuristic  used  for  the  customer  aggregation 
scheme  to  approximate  throughputs.  (6.1)  can  be  re-written  to  get 


TS  =  (l—wr)*ADJ (nr+j.ri,  l)  +  wr*ADJ(nr+j,n,M ) 
which  is  simply  an  approximation  for  the  throughput  of  subsystem  S  with  the 

designated  customer  population.  The  approximation  is  a  linear  interpolation 

between  the  throughputs  that  would  occur  if  one  of  the  centers  were  saturated 

or  class  r  were  balanced.  The  adjustment  of  equation  (6.1)  results  in  a  matrix 

fKJ 

Gs  that  represents  an  approximate  flow  equivalent  service  center  for  subnet¬ 
work  5. 


The  solution  of  the  entire  network  QN  can  be  computed  by  continuing  the 
normalizing  constant  algorithm  for  the  remaining  service  centers,  starting  with 
the  approximate  C5  (this  solution  would  be  exact  if  Gs  were  exactly  Gs).  The 
accuracy  of  the  solution  depends  both  on  the  number  of  centers  aggregated  and 
the  aggregation  sizes  chosen  for  the  classes.  Most  often,  the  choice  of  the 
number  of  centers  to  aggregate  is  dictated  by  the  loads  on  the  centers  (one 
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should  aggregate  the  lightly  loaded  centers).  The  choice  of  the  aggregation 
sizes  should  be  made  as  described  in  the  previous  chapter. 

6.3  EXAMPLES 

Table  6.1  shows  the  percentage  error  in  throughputs  and  mean  queue 
lengths  observed  using  this  technique  for  two  multiclass  networks.  Approximate 
solutions  have  been  obtained  by  aggregating  from  4  to  16  of  the  16  service 
centers  in  each  network.  The  aggregation  size  used  for  both  classes  in  the  first 
example  was  two,  and  in  the  second  both  classes  were  aggregated  in  groups  of 
four. 

As  the  number  of  centers  aggregated  increases,  the  approximation  gen¬ 
erally  becomes  less  accurate.  As  in  the  customer  aggregation  technique,  queue 
lengths  are  less  accurate  than  throughputs  because  the  former  involve  ratios  of 
non-adjacent  elements  of  The  throughputs  do  not  behave  monotonically 

in  the  number  of  centers  aggregated  because  of  the  adjustments  done  to  them. 
These  adjustments  do  not  track  the  exact  solution  accurately.  It  is  therefore 
possible  for  the  adjusted  solution  to  wander  near  the  exact  solution,  with  the  two 
functions  occasionally  crossing.  This  is  observed  for  class  B  throughput  in  the 
first  of  the  two  examples  of  table  6.1. 

6.4  AN  ALTERNATIVE  TECHNIQUE 

Another  type  of  center  aggregation  technique  has  been  used  in  practical 
case  studies  to  reduce  the  complexity  of  the  models  produced.  The  procedure 
followed  is  to  sum  the  relative  utilizations  of  many  very  lightly  utilized  devices 
(typically  disks)  to  form  a  single  service  center  in  the  model  representing  the 
aggregate  of  the  individual  devices. 

A  slight  improvement  to  this  technique  is  to  represent  the  aggregate  center 
by  a  delay  service  center  (i.e.,  one  with  a  NQ  service  discipline).  The  assumption 
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Tabic  G.i  -  Aggregation  of  Centers 
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required  to  justify  this  technique  is  that  very  little  queueing  will  occur  at  devices 
with  small  utilizations.  The  mean  delay  incurred  by  a  customer  at  each  such 
device  is  therefore  approximately  its  service  time  there.  The  total  delay  in  pass¬ 
ing  through  all  such  devices  is  then  roughly  the  sum  of  the  total  service  require¬ 
ments  at  them.  Intuitively,  we  expect  the  throughputs  predicted  using  a  NO 
aggregate  service  center  and  an  FCFS  center  to  bound  the  actual  throughput  of 
the  network,  since  in  one  case  the  queueing  delay  is  underestimated,  and  in  the 


other  it  is  overestimated. 
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Aggregate  relative  utilization  is  14%  of  total. 
X  =  (1  2  3  4  4  4  3(agg.)  ) 


#  Centers  Aggregated 


Aggregate  relative  utilization  is  33%  of  total. 

X=(l  2  3  444  9(agg.)  ) 

% 

Error 

3 

2  4  6  8 

#  Centers  Aggregated 

Figure  6.3 

Figure  6.3  shows  the  error  observed  using  the  NQ  technique  for  two  sample 
networks.  The  graphs  plot  relative  error  against  the  number  of  service  centers 
summed  to  form  the  aggregate.  In  each  case  the  relative  utilization  of  the 
aggregate  is  held  fixed,  and  it  is  assumed  that  all  centers  from  which  it  is  com¬ 
posed  are  utilized  equally.  As  is  expected,  the  NQ  aggregate  is  most  accurate 
when  the  individual  devices  composing  11  have  significantly  lower  utilizations 
than  the  bottleneck  center.  This  is  simply  a  reflection  of  the  fact  that  in  those 
cases  very  little  queueing  will  occur  at  the  devices  of  interest,  which  is  just  the 
assumption  made  in  forming  the  aggregate. 
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Because  no  queueing  can  occur  at  the  aggregate  center,  the  error  in  the 
technique  does  not  depend  on  the  relative  utilization  of  the  aggregate,  but  only 
on  the  individual  utilizations  of  the  devices  aggregated.  The  technique  can 
therefore  be  applied  to  combine  any  number  of  service  centers,  provided  that 
each  has  sufficiently  small  queueing  delays  associated  with  it. 


CHAPTER  7  -  COMPARISON  OF  CUSTOMER  AND  CLASS  AGGREGATION 


In  previous  chapters,  approximate  solution  techniques  based  on  aggrega¬ 
tion  of  customer  classes  and  customers  themselves  were  presented.  In  this 
chapter  the  two  techniques  are  compared  to  determine  the  relative  merits  of 
each.  (We  do  not  examine  center  aggregation  here  because  its  behavior  is  dom¬ 
inated  by  the  customer  aggregation  technique.) 

7.1  THE  APPROXIMATION 

The  starting  point  for  this  comparison  is  a  fairly  large  queueing  network 
model,  which  we  hypothesize  has  been  created  by  a  system  analyst  as  a  model  of 
a  computer  system  under  study.  We  further  hypothesize  that  the  model  is  too 
large  to  be  solved  exactly,  and  so  an  approximate  solution  technique  is  manda¬ 
tory.  The  problem  is  to  decide  which  technique  is  most  applicable. 

Table  7.1  -  Network  Description 
Na  -  15  Na  =  15  Nc  -  15 

XA  =(12  1  3  0  0  0  0  0  0  0  0  0  1  10) 

Xq  =  (0  00221  3350  0001  10) 

Xc  =  (0  0  0  0  0  0  0  0  3  3  2  1  4  G  2) 

a1 5(n)  =  (1  1.8  2.4  2.9  3.3  3.5  3.07  3.7  3.7  ...  ) 

The  network  actually  considered  is  listed  in  table  7.1.  (We  have  especially 
chosen  a  network  that  is  as  large  as  possible  if  we  are  to  compute  the  exact 
solution.  This  (very  expensive)  exact  solution  is  shown  in  table  7.2.)  There  are 
three  customer  classes,  with  fifteen  customers  in  each  class,  and  fifteen  service 
centers.  One  of  the  service  centers  is  load  dependent,  with  monotone  non¬ 
decreasing  rates  as  a  function  of  the  load.  The  three  customer  classes  are  quite 
distinct  statistically,  as  shown  by  their  routing  behavior.  All  classes  visit  centers 
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fourteen  and  fifteen,  while  class  A  additionally  visits  centers  one  to  four,  class  B 
centers  four  to  nine,  and  class  C  centers  nine  to  thirteen.  We  see  further  from 
the  relative  utilization  vectors  that  class  C  has  its  largest  service  requirement  at 
center  fourteen,  while  classes  A  and  B  each  require  the  most  service  from 
center  fifteen. 

Table  7.2  -  Exact  Solution 

Cost  -  Time:  2,269,184  Space:  4096 

ABC  Total 
System  Throughtput:  .207  .126  .110  .443 

Bottleneck  Queue  Length:  3.172  2.021  9.366  14.559 

In  choosing  between  class  and  customer  aggregation,  we  must  take  into 
consideration  the  known  error  behavior  of  each  technique.  Customer  aggrega¬ 
tion  tends  to  be  less  accurate  than  class  aggregation  for  multiple  class  net¬ 
works.  However,  claaa  aggregation  may  perform  poorly  when  aggregating  very 
dissimilar  classes,  such  as  the  three  in  this  network.  Weighing  the  strengths  of 
each,  class  aggregation  is  the  first  choice. 

Given  that  class  aggregation  has  been  selected,  we  follow  the  procedure 
recommended  in  chapter  four  for  selecting  aggregates.  The  first  step  is  to 
obtain  an  approximate  solution  by  aggregating  all  three  classes  into  one.  This 

(1  +  E  Mr)2 

requires  0  ((M  —  l)(l  +  Y]ATr)  + - — - )  =  1926  operations  for  each  iteration 

~r  2 

r 

during  the  solution,  and  0  (l+^Nr)  =  46  locations.  This  procedure  required  11 

r 

iterations  to  converge  to  predicted  multiprogramming  levels  within  .05  (.33%)  of 
the  true  multiprogramming  levels.  The  total  operations  count  is  therefore  on 
the  order  of  21,186,  or  about  1%  of  that  for  the  exact  solution.  The  results  of 
this  solution  are  given  in  table  7.3. 


The  cheap  aggregate  solution  indicated  that  classes  B  and  C  are  most  alike, 
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Table  7.3  -  Aggregation  (ABC) 

Cost-  Time:  21,186  Space:  46 

A  B  C  Total 
System  Throughputs:  .202  .120  .107  .428 

Bottleneck  Queue  Length:  2.965  1.764  9.498  14.223 


since  each  has  significant  queue  lengths  at  centers  nine  and  fourteen,  while 
class  A  does  not  even  visit  center  nine.  Based  on  this  information,  the  approxi¬ 
mate  solution  obtained  by  aggregating  classes  B  and  C  was  computed  (table  7.4). 


Table  7.4-  Aggregate  (A)(BC) 


Cost  -  Time:  342,240  Space:  496 


B 

.123 


C 

.107 


Total 

.437 


System  Throughput:  .207 

Bottleneck  Queue  Length:  3.291  1.863  9.747  14.901 

This  approximation  requires  0  ((NA  +  l)(NB  +Airc-  + 1))  =  496  locations,  and 

(NA  +  l)2(NB+Nc  +  iy‘ 


0((M-1)(Na  +  1)(Nb+Nc+1)+ 


-)  =  68,448  operations  per  step 


of  the  iteration.  Convergence  to  multiprogramming  levels  within  .05  of  the  exact 
values  was  achieved  in  five  iterations,  for  a  total  operation  count  of  342,240. 
This  is  a  little  more  than  sixteen  times  as  expensive  as  the  single  class  approxi¬ 
mation,  but  is  only  15%  of  the  computation  required  for  the  exact  solution. 


Assuming  we  do  not  have  the  exact  solution,  we  note  that  the  variation 
between  the  two  approximate  solutions  is  small.  Throughputs  vary  by  less  than 
3%,  and  queue  lengths  bv  less  than  11%.  Since  the  variation  is  small,  it  is  likely 
that  the  two  approximate  solutions  are  near  the  true  solution.  Of  the  two,  the 
(A)(BC)  aggregation  is  more  likely  the  better  approximation,  since  it  retains 
more  of  the  characteristics  of  the  original  network  than  the  single  class  approxi¬ 
mation. 


If  the  two  aggregate  solutions  had  differed  considerably  in  their  predictions. 


we  could  not  have  concluded  that  either  was  near  the  true  solution.  In  that 
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case,  distinct  aggregations,  each  combining  all  but  one  class,  might  be  per¬ 
formed.  An  estimate  fur  the  exact  solution  could  then  be  found  by  extracting 
measures  of  the  unaggregated  class  in  each  solution.  However,  in  these  cases 
there  is  no  assurance  of  compatibility  of  the  measures  for  each  class.  A  more 
expensive  procedure  is  to  continue  increasing  the  number  of  aggregate  classes 
used  until  the  variation  In  the  solutions  is  sufficiently  small.  This  procedure 
might  be  necessary  if  there  were  multiple  groups  of  classes,  each  of  which  was 
strongly  bottlenecked  at  a  distinct  device.  However,  each  additional  aggregate 
class  greatly  increases  the  complexity  of  the  network,  and  so  it  may  not  be 
feasible  to  extend  the  process  sufficiently  because  of  the  storage  and  time 
requirements  involved. 

Table  7.5  lists  the  percentage  error  for  each  of  the  four  possible  aggregate 
solutions  for  the  network  of  table  7.1.  The  estimates  chosen  by  the  procedure 
described  above  are  the  most  accurate  overall.  The  most  accurate  solution  for 
each  class  (in  terms  of  throughput)  is  the  aggregation  that  leaves  that  class 
unaggregated  while  aggregating  the  others.  In  all  cases,  throughputs  are  within 
6%  and  queue  lengths  within  13%  of  the  exact  solution. 


Table  7.5  -  All  Aggregates 


Cost/ 

# 

Throughput 

Queue  Length 

Aggreg .  Iter. 

Iter. 

ABC 

Total 

A 

D 

C  Total 

(ABC)  1926 

11 

-2.8  -5.1  -3.0 

-3.5 

-6.5 

-12.7 

-.7  -2.3 

(BC)(A)  60,480 

5 

0.0  -2.6  -3.2 

-1.5 

3.7 

-7.8 

4. 1  2.4 

(AC)(B)  60,480 

10 

-4.2  -i.2  -.5 

-2.5 

-11.5 

-4. 1 

-.8  -3.7 

(AB)(C)  60,480 

0 

.0  -5.3  -.2 

-1.2 

-1.2 

11.2 

-4.0  -4.4 

The  justification  given  earlier  for  choosing  class  aggregation  over  customer 
aggregation  for  the  network  of  table  7.1  is  that  customer  aggregation  is  not  as 
well  suited  as  class  aggregation  for  networks  with  many  classes.  The  heuristic 
adjustments  done  to  the  performance  measures  by  the  customer  aggregation 
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procedure  work  well  when  each  class  of  t.he  network  either  has  balanced  usage 
of  the  devices,  or  saturates  a  device.  The  heuristics  are  poor  when  the  interfer¬ 
ence  of  the  classes  at  a  center  causes  a  bottleneck  that  is  not  apparent  from  the 
behavior  of  any  one  class.  Unfortunately,  this  situation  exists  in  the  model 
under  consideration.  We  therefore  expect  the  results  obtained  using  customer 
aggregation  to  be  inferior  to  those  obtained  with  class  aggregation. 

Table  7.6  lists  the  relative  error  of  the  approximate  solution  obtained  using 
customer  aggregation  with  a  number  of  aggregation  sizes.  As  expected,  the 
solutions  are  generally  less  accurate  than  those  obtained  using  class  aggrega¬ 
tion.  We  note  that,  while  the  accuracy  of  the  performance  measures  by  class  is 
quite  variable,  the  total  performance  measures  (summed  over  the  classes)  are 
reasonably  accurate. 

Table  7.6  -  Customer  Aggregation  (Relative  Error) 

Aggreg.  System  Throughput  Bottleneck  Qlen 


Pa 

PB  PC 

Time 

Space 

A 

B 

C 

Total 

A 

B 

C 

Total 

3 

3 

3 

14904 

216 

-.8 

35.2 

70.5 

27.1 

24.4 

8.5  -6.7 

2.2 

5 

5 

5 

3200 

64 

-20.5 

10.3 

43.3 

5.0 

12.7 

.4  -13.3 

-3.8 

5 

5 

4 

1 

30944 

r\r  f 
CdU 

-11.2 

19.0 

1 16.5 

30.1 

100.4 

67.6 

-.6 

30.9 

1 

1 

15 

54272 

512 

84.3 

140.4 

20.9 

04.5 

-68.3 

-63.4  -26.7 

-42. 1 

Table  7.6  points  out  that  it  is  best  to  aggregate  all  classes  nearly  equally  (so 
that  the  relationship  between  customers  of  different  classes  stays  nearly  con¬ 
stant)  than  to  select  one  class  to  leave  unaggregated  and  aggregate  the  rest. 
This  behavior  is  just  the  opposite  of  that  for  class  aggregation.  The  results  also 
show  that  the  adjustment  heuristics  used  act  somewhat  erratically  when  the 
competition  for  resources  by  the  customers  in  the  other  classes  dominates  the 
behavior  of  each  individual  class.  This  occurs  probably  because  the  heuristics 


consider  individual  classes  one  at  a  time,  and  do  not  reflect  the  effect  of  the 


-  141  - 


other  classes. 


Another  possible  approximate  solution  technique  is  to  combine  the  best 
aspects  of  the  class  and  customer  aggregation  techniques.  The  class  aggrega¬ 
tion  technique  is  good  for  networks  with  many  classes,  but  requires  an  iterative 
solution.  At  each  step  of  the  iteration,  the  solution  of  a  network  with  fewer, 
larger  classes  must  be  obtained.  11  in  the  class  aggregation  technique  all  classes 
are  aggregated  into  one,  the  customer  aggregation  technique  can  be  used  to 
obtain  cheap,  accurate  solutions  for  each  iteration  step.  Since  customer  aggre¬ 
gation  performs  well  for  single  class  networks,  the  combination  of  two  methods 
is  an  approximation  technique  with  inherent  error  behavior  like  trie  class  aggre¬ 
gation  technique,  but  is  considerably  cheaper. 


Tabic  7.7  -  Composite  Aggregation 


Cost/ 

1’, 

tr 

Throughput 

Bottleneck  Qlen 

7A 

F 

Iter. 

Iter. 

A 

B 

C  T 

e’eal 

A_ 

B 

C 

Total 

i 

X 

i  one 

X  «  V  kO  LJ 

i  i 

X  1- 

p. 

-5. 1 

-3.0  - 

3.5 

-6.5 

12.7 

_  V 

■  t 

-2.3 

3 

352 

9 

6.8 

-5.5  - 

10.9  - 

1.2 

A 

.-r 

-15.1 

-9.3 

-3.0 

5 

190 

5 

7.6 

-5.8  - 

10.6 

-.7 

-6.8 

-21.9  - 

16.0 

-14.8 

9 

102 

5 

8.3 

-0.7  - 

10.1 

-.5 

-17.8 

-32.2  - 

26.1 

-25.1 

15 

84 

5 

9.0 

-7.7 

-9.7 

-.4 

-26.3 

-40.3  - 

33.8 

-33. 1 

7/ 

7  lists 

the 

peree 

ntage 

error 

in 

the  solutions 

obtained 

combined  method  for  various  choices  of  the  customer  aggregation  size.  The 
accuracy  of  the  method  is  bounded  by  the  class  aggregation  scheme  with  an 
exact  solution  at  each  iteration,  and  decreases  as  the  aggregation  size  is 
increased.  As  is  expected,  throughputs  are  more  accurate  than  queue  lengths. 


7.2  SUMMARY 


ft  is  very  difficult  to  give 


a  comprehensive  qualitative  comparison  of  the 


class  and  customer  aggregation  techniques.  The  observed  behavior  of  the  two 
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approximations  indicate  that  class  aggregation  is  the  better  method  for  multi- 
class  networks,  unless  there  is  little  interaction  between  the  classes.  Class 
aggregation  also  results  in  the  greatest  cost  reduction  for  these  networks,  and 
so  is  most  beneficial  in  terms  of  economy  of  solution. 

On  the  other  hand,  class  aggregation  is  not  applicable  to  single  class  net¬ 
works,  nor  to  networks  with  more  genera!  types  of  ioad  dependence  than  func¬ 
tions  of  the  total  queue  length  at;  the  center.  Customer  aggregation  is  useful  in 
these  situations.  Even  single  class  networks  can  be  very  expensive  to  solve 
exactly  if  they  contain  load  dependent  service  centers,  since  the  number  of 
operations  involved  is  proportional  to  the  square  of  the  number  of  customers. 
By  reducing  the  multiprogramming  levels,  customer  aggregation  greatly 
reduces  the  expense  of  the  solution. 


Chapter  8  -  MEAN  VALUE  ANALYSIS  BASED 
APPROXIMATIONS 


In  previous  chapters  we  have  been  concerned  with  obtaining  cheap,  approx¬ 
imate  solutions  to  separable  queueing  networks  using  aggregation.  The  tech¬ 
niques  developed  were  very  closely  tied  to  the  equilibrium  state  probabilities  of 
these  networks.  Tt.  was  therefore  natural  to  use  modifications  of  convolution 
solution  methods  in  our  approximations,  as  these  methods  deal  with  state  pro¬ 
babilities. 

Bard  [79]  suggests  a  technique  for  approximating  the  solution  of  large,  load 
independent,  separable  queueing  networks  that  is  based  solely  on  mean  perfor¬ 
mance  measures,  rather  than  state  probabilities.  His  technique  is  a 
modification  of  the  mean  value  analysis  [Reiser  and  Lavenberg  78,80]  exact  solu¬ 
tion  technique.  In  this  chapter  we  present  this  method,  and  develop  a 
modification  with  slightly  increased  accuracy,  as  well  as  compare  it  with  the 
methods  developed  in  the  previous  chapters.  We  begin  with  a  brief  overview  of 
the  mean  value  analysis  solution  technique. 

8.1  MEAN  VALUE  ANALYSIS 

Mean  value  analysis  (MVA)  is  a  solution  technique  for  product  form  networks 
first  proposed  by  Reiser  and  Lavenberg  [78,80].  The  technique  is  based  on 
Little’s  theorem  [Little  61]  applied  to  individual  service  centers  and  the  network 
as  a  whole.  The  key  to  the  computational  procedure  is  the  transit  time  equ  tion 
[Reiser  and  Lavenberg  78] 


7i  “  1  ®7n  \  ft* 


(8.1a) 


Here  WKm ($)  is  the  mean  transit  time  of  class  r  at  center  m  in  the  network  with 
population  Sr  m  is  the  mean  service  time  of  class  r  at.  center  m,  am(n)  is  the 
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ratio  of  the  service  rate  with  n  customers  to  the  rate  with  1  customer,  and 
—$r)  is  the  equilibrium  probability  that  n- 1  customers  of  any  class 
are  at  center  m  with  customer  population  —  in  the  network. 

For  load  independent  service  centers,  equation  (8.1a)  reduces  to 

VnmCtf )  =  Sr.m  (1  +  Sm(tf-*r))  (8.1b) 

where  Qm(J^—ir)  is  the  mean  queue  length  of  center  m  with  customer  population 

in  the  network. 

Given  the  the  throughput  of  each  class  r,  Tr($),  can  be  computed 

by 

Tr(ft)  =  ~y - — -  (8.2) 

£  K.n,  K.mlti) 
m= 1 

where  VrM  is  the  average  number  of  visits  to  service  center  m  by  class  r  jobs, 
and  Nr  is  the  number  of  class  r  customers  in  the  network. 

Finally,  the  mean  queue  length,  or  queue  length  distribution, 

Pm  (n  \&).  can  be  computed  as  required  by 

Qm$)  =  £  Tr(i))  Vnm  Wnm0)  (8.3a) 

r  = 1 

—  ,  v  Yj  Sr,m  Fr(-^)  Vr.m  Pm(n~  1  7l>0 

<hnW  r=1 

m  „  (8-3b) 

n=0 

3  =  1 

Starting  with  the  trivial  solution  for  the  network  with  no  customers,  equa¬ 
tions  (8.1),  (8.2),  and  (8.3)  can  be  used  iteratively  to  derive  the  solution  for  suc¬ 
cessively  larger  populations  up  to  the  full  population  Af.  This  solution  yields  the 
mean  throughputs,  queue  lengths,  and  transit  times  by  class  and  center  for  both 
load  independent  and  load  dependent  centers. 

0.2  APPROXIMATE  SOLUTIONS  USING  MEAN  VALUE  ANALYSIS 


pm(n \rf)  = 
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Bard  [79a]  proposes  that  in  a  load  independent  network  with  a  large 
number  of  customers,  a  good  approximation  for  Qm($ -£r),  the  mean  queue 
length  at  center  m  for  customer  population  $ -3 r,  is  Under  this  assump¬ 

tion,  he  substitutes  Qm($)  for  Qm( $-&r)  into  the  transit  time  formula  for  load 
independent  centers  (equation  (8.1b))  for  customer  population  /?,  and  iterates 
to  obtain  a  solution.  The  iteration  begins  with  some  guess  for  the  Qvl($).  From 
this  guess,  a  set  of  V/rm(N)  are  computed  by  (8.1b),  and  then  a  corresponding 
set  of  Tr($)  and  Qm(N)-  These  new  queue  lengths  are  used  to  compute  updated 
Wrtm($)<  and  the  cycle  repeats.  The  iteration  halts  when  some  convergence  cri¬ 
terion  is  met  (e.g.,  when  successive  guesses  for  Qm(rf)  differ  by  less  than  some 
small  e).  Bard  [79a]  shows  that  this  solution  is  equivalent  to  assuming  that  the 
bottleneck  of  the  system  is  fully  saturated  with  all  customers  residing  in  the 
bottleneck’s  queue. 

A  modification  of  this  basic  scheme  was  proposed  by  Schweitzer  [Bard  79b]. 
A  better  approximation  for  -<?r)  is  given  by 

~^Qr.m0)  +  T,  Qim(&)  (3.4) 

This  approximation  can  be  substituted  into  equation  (8.1b),  and  an  iteration  per¬ 
formed  to  obtain  a  solution  for  the  full  customer  population  N  in  a  manner  simi¬ 
lar  to  that  described  above. 

Table  8.1  lists  exact  solutions  for  two  queueing  networks,  nod  the  approxi¬ 
mate  solutions  obtained  using  equation  (8.4).  As  can  be  seen,  the  approximation 
is  very  good  for  both  large  and  small  customer  populations.  Empirical  results 
also  indicate  that  convergence  is  quite  fast  (less  than  25  iterations  typically  for 
queue  length  convergence  to  .01).  This  approximation  can  therefore  be  a  rea¬ 
sonable  alternative  to  exact  solution  techniques  because  of  its  speed  and  accu¬ 
racy.  An  additional  advantage  is  that  the  storage  requirement  of  the  approxima¬ 
tion  is  only  0(RM),  which  is  very  significantly  less  than  that  required  for  an 
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Table  8.1  -  Approximation  (3.4) 

Network  1:  20  Class  A  customers  with  relative  utilization 

vector  (l  3  5) 

15  Class  B  customers  with  relative  utilization 
vector  (10  1  1) 


Utilization 

Que 

ue  Length 

Cl 

C2 

C3 

Cl 

C2 

C3 

Approximate  -  Cost  = 

35/iteration  x 

15  iterations 

Class  A 

.131 

.  542 

.904 

3.13 

1.26 

15.51 

Class  B 

.804 

.080  . 

,080 

13.36 

.21 

1.43 

Exact  -  Cost  =  2106 

Class  A 

.184 

.551 

918 

3.24 

1.49 

15.26 

Class  B 

.316 

.082  . 

032 

13.31 

.22 

1.47 

Network  2:  2  Class  A  customers  with  relative  utilization 

vector  (13  5) 

3  Class  B  customers  with  relative  utilization 
vector  (2  1  7) 


Utilization  Queue  Length 


Cl  C2  C3  Cl  C2 
~  °-5/iteration  x  6  iterations 


C3 


Approximate  -  Cost 
Class  A 
Class  B 
Exact  -  Cost  -  72 
Class  A 
Class  B 


.073 

.219 

.365 

.173 

.087 

.607 

.074 

.222 

.370 

.179 

.089 

.625 

.09 

.28 

1.63 

.22 

.12 

2.66 

.10 

.29 

1.61 

.23 

.13 

2.63 

exact  solution.  Further,  the  amount  of  storage  required  by  the  approximation  is 
independent  of  the  customer  population,  making  it  suitable  for  networks  with 
large  numbers  of  customers. 

We  note  here  that  the  iterative  solution  is  exact  for  any  network  with  a  cus¬ 
tomer  population  consisting  of  only  one  customer.  Equation  (8.4)  predicts  that 
the  mean  queue  length  at  center  in  when  there  are  no  customers  in  the  network 
is  0,  which  is  trivally  correct.  The  solution  obtained  for  1  customer  is  therefore 
exact.  This  is  an  indication  that  the  approximation  might  be  good  for  small 
populations. 

The  approximation  is  also  asymptotically  correct  as  the  number  of  custo¬ 
mers  in  any  one  class,  say  r,  grows.  For  sufficiently  large  Nr,  one  of  the  service 
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centers  will  become  the  bottleneck  of  the  system  (the  argument  also  holds  for 
multiple  bottlenecks).  As  each  additional  class  r  customer  is  added  to  the  net¬ 
work,  it  will  lend  to  spend  all  of  its  time  al  the  bottleneck  device.  The  queue 
length  at  the  bottleneck  will  therefore  increase  by  about  one  customer  for  each 
customer  added.  This  is  exactly  the  prediction  made  by  equation  (8.4),  since 

Qr.mitf) 


Nr 


1  as  Ah  ->  «>. 


However,  there  is  an  inaccuracy  in  the  approximation  that  could  be 
significant  for  networks  with  only  a  small  number  of  customers  in  each  of 
several  classes.  Equation  (8.4;  says  that  the  mean  queue  lengths  of  all  classes 
cxecpt  class  r  are  unaffected  by  the  introduction  of  a  class  r  customer  (this 
same  assumption  is  made  by  Reiser  [79a, 79b]  and  Reiser  and  Lavenberg  [78]  for 
a  multiple  class  approximation).  This  can  be  a  poor  assumption.  For  example, 
consider  a  network  with  two  service  centers,  and  two  customer  classes  with  one 
customer  each.  Let  class  A  have  relative  utilization  vector  (l  5),  that  is,  it 
receives  most  of  its  service  at  center  2.  Let  class  B  receive  service  only  at 
center  1,  and  therefore  has  a  relative  utilization  vector  (l  0).  Equation  (8.4) 
predicts  that  the  mean  queue  length  of  class  A  customers  at  center  1  is  identical 
in  the  network  with  populations  of  only  a  single  class  A  customer  and  with  one 
customer  in  each  of  the  two  classes.  Clearly,  this  is  a  very  bad  assumption. 

Table  'G.2  -  Example 

Na  =  1  Na  =  1. 

Xa  =  (i  r.)  xB  =  (i  o) 

Utilization  Queue  Length 


Cl 

no 

K.'  t*u 

Cl 

r  o 

Approximate: 

rn  ~  ~  a 

n. 

a  a  on 

.7143 

.238 

.714 

Class  B 

.7777 

.0000 

1.0 

.000 

Exact: 

Class  A 

.  1429 

.7143 

.288 

.714- 

Class  B 

.8571 

.0000 

1.0 

nnn 

.  U  WVJ 
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Table  8.2  gives  the  exact  and  approximate  solutions  for  the  above  network. 
We  note  that  there  is  considerable  error  in  the  approximation  for  class  B  utiliza¬ 
tion  (and  hence  throughput).  In  general,  we  can  expect  this  error  to  occur 
■whenever  the  introduction  of  a  customer  in  one  class  significantly  afreets  the 
queue  lengths  of  the  other  classes.  We  expect  this  effect  to  be  most  significant 
for  the  first  few  customers  in  each  class,  indicating  that  networks  with  some 
classes  having  only  a  few  customers  might  be  subject  to  this  type  of  error. 

As  previously  stated,  the  approximation  is  poor  in  these  cases  because  it 
assumes  that  only  a  single  class  is  affected  by  the  introduction  of  a  new  custo¬ 
mer  to  the  network.  For  this  reason  we  propose  the  following  approximation  as 
the  basis  of  an  iterative  solution: 


Wr. -  -S!i—  (S.5) 

A  j 

The  iteration  is  initialized  by  some  guess  for  the  Wrm( ]$)  and  Then, 

using  equations  (6.2),  and  (3.3b),  we  compute  throughputs  and  mean  queue 
lengths  for  customer  populations  j  — 1, Finally,  equation  (9.1b)  is 

used  to  compute  a  new  set  of  (A^)  from  the  performance  measures  obtained 
from  the  R  solutions.  These  are  then  used  to  begin  the  next  iteration  step.  The 
iteration  continues  until  some  convergence  criterion  is  met.  This  iteration 
requires  time  and  space  0{R2M)  in  contrast  with  0(RM)  for  equation  (8.4). 

Table  8.3  lists  the  results  of  the  approximations  given  by  equations  (8.4) 
and  (8.5).  In  our  experiments,  equation  (8.5)  has  been  the  most  accurate  and 


the  most  robust.  We  note  that  equation  (8.5)  is  exact  for  networks  with  only  a 
single  customer.  However,  this  exact  solution  results  because  the  throughput 
formula  8.2  is  exact  for  networks  with  no  customers  regardless  of  the  predicted 
transit  times.  Any  approximation  for  WTTn(fJ  —&r)  would  therefore  be  exact  for 
networks  with  only  one  customer. 
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Tabic  G.3  -  Approximation  (9.5) 


Na  -  1  i\  jj  -  1 

XA  -  (1  1.5)  Xn  =  (1  0) 


Utilization 

Queue  Length 

Cl 

C2 

Cl 

C2 

Approximate  -  Eq. 

8.4 

Class  A 

.206  . 

427 

.571 

.429 

Class  B 

.633 

1.0 

Approximate  -  Eq. 

8.5 

Class  A 

.286  . 

429 

.571 

.4-29 

Class  B 

.714 

1.0 

Exact  - 

Class  A 

.286  . 

429 

.571 

i  no 

Class  B 

.714 

1.0 

N A  =  2 

Nn  = 

2 

Nr 

=  2 

XA  =  (10  1  1) 

1! 

£ 

(i  10  1) 

Xc 

=  (1  1 

10) 

Utilization 

Queue  Length 

Cl 

C2 

C3 

Cl 

no 

4w/  tu 

C3 

Approximate  -  Eq.  8.4 

Class  A 

.715 

.072 

.072 

1.59 

.21 

.21 

Class  B 

.072 

.715 

.072 

.21 

1.58 

.21 

Class  C 

.072 

.072 

.715 

.21 

.21 

1.58 

Approximate  -  Eq.  8.5 

Class  A 

.755 

.076 

.076 

1.55 

.22 

.22 

Class  B 

.076 

.756 

.076 

.23 

1.55 

.23 

Class  C 

.076 

.076 

.756 

.22 

.22 

1.55 

Exact  - 

Class  A 

.771 

.077 

.077 

1.54 

.23 

.23 

Class  B 

.077 

.771 

.077 

.23 

1.54 

.23 

Class  C 

.077 

.077 

.771 

.23 

.23 

1.54 

We  have  tried  other  approximations  based  on  the  transit  time  formula.  In 
particular,  we  have  substituted  the  following  for  equation  (8.5): 


wr.n0)- 

wrm(U-sf) «  wr,m{$)- 


‘ J  r.  m 


S’ 

ur.vz 

°r,7n 

“jv~ 


(Qm0)-Qm(#-3j)) 
Q^m(A f-ar) 


(8.6a) 
(8.6b) 
(8. 6c) 


The  motivation  for  equation  (8.6a)  is  that  intuitively  it  appears  to  be  more 


correct  than  equation  (8.5).  Equation  (8.6b)  uses  information  obtained  during 
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the  iteration  process  to  estimate  the  change  in  class  r’s  queue  lengths  when  a 
class  j  customer  is  removed  from  the  system.  Equation  (8.8c)  is  exact  for  net¬ 
works  with  one  customer  in  each  of  two  classes.  None  of  these  modifications 
results  in  a  general  improvement  over  equation  (8.5),  and  so  were  not  pursued 
further. 


8.3  SUMMARY 

We  have  applied  the  iterations  described  by  equation  (8.4)  and  (8.5)  to  a 
number  of  the  standard  examples  from  chapters  4,  5,  and  6.  Because  the  MVA- 

based  iterations  are  not  applicable  to  networks  with  any  load  dependent  service 
centers,  all  examples  are  load  independent. 


Table  8.4  -  Standard  Example  (Relative  Error) 

Na  =32 
Np  =  32 


EXAMPLE  A  - 


XA  =  (1  .5  .5  .5  .5  .5  .5  .5  .5  .8  .8  .8  .8  .8  .8  .8) 
Xa  =  (.5  .5  .5  .5  .5  .5  .5  .5  .8  .3  .8  .8  .3  .8  .8  1) 


Cost 

Class  A 

Cla 

ss  B 

TIME 

SPACE 

TPUT 

QLEN 

TPUT 

QLEN 

Exact 

34,848 

2,178 

0.0 

0.0 

0.0 

0.0 

Class  Agg. 

4,100  (1040x4) 

130 

-.2 

-1.3 

.1 

2.0 

Cust.  Agg. 

9,248 

578 

-1.3 

.8 

-1.3 

.0 

Eq.  8.4 

3,832  (19.2x46) 

50 

-.2 

12.5 

-1.0 

14.3 

Eq.  8.5 

23,040  (840x36) 

150 

-.1 

1 1.2 

-.9 

12.8 

IXAMPLE  B  - 

XA 

=  (1  .25  .25  .25  .25  . 

25  .25  . 

25 

.25  .5  . 

5  .5  .5  . 

5  .5  .5) 

/  r~  r~  r*  r~  cz 
JJ  —  ^ .  O  ■  ^  •  O  -*_)  •  <J) 

.5  .5  .5 

.8 

.8  .8  .3 

.8  .8  .8 

1) 

Cost 

Class  A 

Cla 

ss  B 

TIME 

SPACE 

TPUT 

QLEN 

TPUT 

QLEN 

Exact 

34,843 

2, 178 

0.0 

0.0 

0.0 

0.0 

Class  Agg. 

5,200 (1040x5) 

130 

_p  n 

U  P 

u  ■  n* 

-.6 

-4.1 

Cust.  Agg. 

9,248 

578 

-11.1 

-13.5 

-.9 

-4.7 

Eq.  8.4 

9,408  (192x4-9) 

50 

-4.1 

9.2 

.2 

5.1 

Eq.  8.5 

16,000  (640x25) 

150 

-3.7 

8.0 

.2 

4.4 
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EXAMPLE  C  - 


XA 

=  (1  .25  .25  .25  .25  . 

25  .25  .25 

.25  .5 

.5  .5  .5  .5  . 

,5  .5) 

XB  =  (.8  .8  .8  .8 

.3  .8  .3  .8 

1111 

1  1  1  1) 

Cost 

Cl  a: 

jAl 

Class  B 

TIME 

SPACE 

TPUT 

QLEN  TPUT 

QLEN 

Exact 

34,848 

2,178 

0.0 

0.0 

0.0 

0.0 

Class  Agg. 

5,200  (1040x5) 

130 

0.0 

-.3 

-.2 

-4.3 

Gust.  Agg. 

9,248 

578 

-6.5 

-9.2 

-.4 

-3.6 

Eq.  8.4 

5,952  (192x31) 

50 

-2.2 

4.4 

.3 

3.5 

Eq.  8.5 

14,720  (640x23) 

1  r.n 

-1.3 

3.7 

o 

3.1 

Table  8.4  shows  the  results  of 

three  of 

these 

examples 

.  We 

see  that 

MVA-based  approximations  are  very  accurate  for  throughput  estimates,  but  can 
be  much  less  accurate  for  queue  lengths. 

In  summary,  the  MVA-based  approximations  generally  give  very  good  results 
for  throughputs,  and  somewhat  less  accurate  results  for  queue  lengths.  Because 
they  require  very  little  storage,  and  only  a  moderate  amount  of  computation, 
they  can  be  quite  cost  effective  for  load  independent  networks.  Their  principal 
drawback  seems  to  be  that  they  are  not  applicable  to  networks  with  any  load 
dependent  service  centers,  which  means  that  they  are  not  suitable  for  use  in  a 
hierarchical  modeling  study. 
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CHAPTER  9  -  CONCLUSIONS  AND  DIRECTIONS  FOE  FUTURE  WORK 


9.1  CONCLUSIONS 

Our  aim  in  this  thesis  has  been  to  examine  the  problem  of  solving  very  large 
separable  queueing  network  models.  By  a  detailed  analysis  of  the  solution  tech¬ 
niques  for  these  networks,  we  have  shown  that  the  resources  required  to  obtain 
exact  solutions  can  quickly  become  prohibitive.  Our  work  therefore  focused  on 
approximation  techniques.  In  particular,  we  have  examined  the  feasibility  of 
obtaining  good  approximate  solutions  by  aggregating  various  network  com¬ 
ponents,  thereby  reducing  the  network  complexity. 

The  work  presented  here  has  both  practical  and  academic  interest.  On  the 
practical  side,  the  techniques  developed  can  be  used  in  situations  where  it  is 
either  impossible  or  prohibitively  expensive  to  solve  the  model  exactly.  It  is 
clear  that  such  large  models  will  be  used  with  increasing  Frequency  as  the  trend 
toward  decentralized  machines  and  cheaper  hardware  continues.  We  have  both 
developed  techniques  to  solve  these  models  approximately,  and  examined  the 
error  behavior  of  these  techniques  using  representative  sets  of  queueing  net¬ 
work  models. 

On  the  academic  side,  we  have  developed  a  set  of  approximate  solution 
techniques  based  on  a  formal  study  of  aggregation  of  network  resources,  and 
have  examined  the  conditions  under  which  the  approximations  can  be  expected 
to  give  good  results.  We  have  attempted  to  derive  analytic  bounds  for  the  error 
of  these  techniques,  with  partial  success.  The  work  presented  forms  a  basis  for 
possible  further  work  aimed  at  improving  the  techniques  or  developing  tighter 
error  bounds  for  them. 

It  is  of  both  practical  and  academic  interest  to  summarize  and  compare 
briefly  the  techniques  discussed:  class  aggregation,  customer  aggregation, 


IRQ  . 

iuu 


center  aggregation,  and  two  variants  of  an  iteration  approximation  based  on 
mean  value  analysis  (those  represented  by  equations  (8.4)  and  (8.5)).  We  begin 
by  reviewing  the  strengths  and  weaknesses  of  each  technique. 

Class  aggregation  is  probably  the  most  natural  approximation  technique  of 
those  discussed.  The  first  analytic  modeling  studies  employed  a  manual  form  of 
class  aggregation  by  combining  the  measurement  data  obtained  for  all  jobs 
completed  during  a  measurement  interval,  and  assuming  that  ail  jobs  were  alike 
(i.e.,  were  in  the  same  class).  The  success  of  these  models  shows  that  class 
aggregation  can  be  quite  accurate.  As  one  would  expect,  it  is  most  accurate 
when  the  classes  combined  are  quite  alike.  In  practice,  this  means  that  custo¬ 
mers  in  the  aggregated  classes  should  spend  most  of  their  time  at  the  same  dev¬ 
ices,  i.e.,  the  classes  should  share  a  bottleneck.  Similarly,  the  technique  per¬ 
forms  worst  when  the  aggregated  classes  have  distinct  bottlenecks. 

Customer  aggregation  is  a  less  intuitive  technique.  For  single  class  net¬ 
works,  it  is  possible  to  trade  accuracy  for  computational  effort  on  an  almost 
continuous  spectrum.  (This  is  done  by  varying  the  accuracy  of  the  transforma¬ 
tion  from  the  aggregate  model  back  to  the  actual  model.)  For  both  single  and 
multiple  class  models,  efficient  heuristics  provide  a  means  of  transforming 
aggregate  performance  measures  into  approximate  performance  measures  of 
the  original  model.  These  heuristics  are  exact  for  two  extreme  model  types,  bal¬ 
anced  and  saturated  networks.  One  can  therefore  expect  the  approximation  to 
work  well  at  these  extremes.  However,  because  the  technique  relies  on  heuris¬ 
tics  that  do  not  fully  capture  the  behavior  of  multiclass  networks,  it  can  be 
somewhat  unreliable  for  them.  Finally,  it  is  the  only  one  of  the  techniques  that 
can  be  used  for  single  class  models  with,  load  dependent  service  centers.  Since 
it  is  both  expensive  to  solve  these  models  exactly  (the  cost  is  0(MN2))  and  the 
customer  aggregation  technique  works  well  for  them,  it  can  be  a  very  useful 
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technique. 

Center  aggregation  is  the  least  efficient  of  the  approximation  techniques,  it 
is  most  useful  when  the  model  represents  a  number  of  devices  with  low  utiliza¬ 
tion,  and  others  with  moderate  or  high  utilization.  Since  the  performance  meas¬ 
ures  of  the  netwrorlc  are  determined  primarily  by  the  more  highly  utilized 
centers,  one  can  reduce  the  eflorl  required  to  solve  the  model  without 
significantly  affecting  the  results  by  aggregating  the  remaining  centers. 

All  three  of  the  above  techniques  are  distinguished  from  the  MV A  techniques 
in  that  they  are  based  on  detailed  information  about  the  state  space  of  the  net¬ 
work.  Among  the  implications  of  this  is  that  they  can  all  be  used  for  networks 
containing  load  dependent  service  centers;  the  MVA-based  techniques  cannot. 
This  detail  has  other  advantages.  One  of  these  is  that  it  is  feasible  to  describe 
formally  the  assumptions  necessary  for  them  to  give  exact  results.  These 
assumptions  are  both  indications  of  when  the  techniques  should  prove  effective, 
and  a  guide  to  possible  modifications  to  them  that  could  improve  their  accuracy. 

The  MVA-based  techniques  involve  a  heuristic  based  on  the  arrival  instant 
theorem  for  separable  networks  ([Seveik  and  Mitrani  79],  [Lavenberg  and  Reiser 
80]).  The  heuristic  is  asymptotically  correct  for  large  customer  populations, 
and  also  performs  very  well  for  small  populations.  (The  prima^  distinction 
between  the  twro  variations  is  that  one  corrects  an  assumption  of  the  other  that 
can  cause  it  to  perform  poorly  for  networks  with  classes  with  only  a  few  custo¬ 
mers  in  each  class.)  The  techniques  give  very  good  predictions  for  throughputs 
and  utilizations,  but  are  less  accurate  for  queue  lengths.  They  do  not  produce 
any  information  about  queue  length  distributions. 

Tabic  9.1  summarizes  the  time  and  space  requirements  for  each  technique. 
As  shown  by  the  table,  a  major  advantage  of  the  MVA-based  algorithms  is  their 
small  storage  requirement,  which  is  independent  of  the  customer  population. 
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Table  9.1  -  Approximation  Technique  Complexity 

EXACT 

CLASS 
AGG 

CUSTOMER 

AGG 

CENTER 

AGG 
MVA 

(8.4) 

MVA 

(8.5) 

R'  =  number  of  aggregate  classes 
AT  =  number  of  centers  aggregated 

They  are  also  very  efficient  in  terms  of  time  required  for  each  iteration.  Of  the 
aggregation  techniques,  class  aggregation  can  result  in  the  most  dramatic 
reduction  in  time  and  space  requirements. 

In  choosing  one  of  these  techniques,  the  analyst  must  be  aware  of  both  its 
computational  cost,  and  the  types  of  models  to  which  it  can  be  successfully 
applied.  In  some  situations  the  choice  is  clear:  a  severe  storage  constraint  may 
necessitate  an  MVA  technique;  a  load  dependent  network  necessitates  an  aggre¬ 
gation  technique.  In  others  situations,  the  choice  will  have  to  be  made  accord¬ 
ing  to  the  analyst's  judgement  and  experience  gamed  from  the  information  in 
this  thesis  and  actual  implementation  and  use  of  the  techniques  in  modeling  stu¬ 
dies. 

9.2  A  CONCLUDING  EXAMPLE 

The  motivation  for  the  work  presented  in  this  thesis  has  been  the  practical 


TIME  SPACE 
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RMYl(Nr+l) 

r=  1 

n<^r+l) 

R' 

R'M  j  J  (Mr  + 1) /iteration 

r  =  1 

fl(Mr+l) 

r  =  l 

R  KT 
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r  \r 
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R  AL  R 
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r=l  TV  r=l 
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r~  1 

6RAf/iteration 

RM  +  R  +  M 

6  (R  +  l)R.M/iteration 

(R  + 1 ) [RM  +R  +M  ] 

— .  -  — . 
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problem  of  obtaining  reasonably  accurate  "olutions  for  very  large  queueing 
models.  In  chapter  1  we  presented  the  Toronto  Stock  Exchange  (TSE)  computer 
system  (figure  1.1)  as  an  example  of  a  system  requiring  a  model  for  which  an 
exact  solution  is  infeasible.  In  this  section  we  briefly  describe  the  approximate 
solution  for  the  model  obtained  using  the  techniques  presented  earlier  in  the 
thesis. 

The  model  analyzed  is  shown  in  figure  1.2.  It  contains  fifty  eight  service 
centers,  forty  customer  classes,  with  about  ten  customers  in  each  class  (there 
are  a  total  of  405  customers).  Each  customer  class  represents  the  workload  on 
a  different  microprocessor  based  terminal  controller.  This  detail  was  desirable 
since  the  model  wras  to  be  used  to  answer  questions  about  the  effects  of  the  dis¬ 
tribution  of  the  workload  among  the  microprocessors. 


An  exact  solution  to  the  model  renuires  time  and  snacc  on  the  order  of  iO40. 

x.  *. 

Therefore,  it  is  not  feasible  to  obtain  the  exact  solution.  In  such  situations  one 
must  resort  to  an  approximation  technique.  In  this  case,  because  the  central 
subsystem  is  represented  by  a  load  dependent  service  center,  the  MVA-based 
techniques  were  not  applicable.  This  means  that  one  of  the  aggregation  tech¬ 
niques  had  to  be  used.  Because  of  the  large  number  of  customer  classes  in  the 
model,  class  aggregation  was  necessary.  (Even  aggregating  all  customers  in 
every  class  into  single  customers  yields  an  aggregate  model  with  computational 
requirements  on  the  order  of  240,  which  is  clearly  unmanageable.)  We  therefore 
applied  class  aggregation,  combining  ail  classes  into  a  single  class.  This  tech¬ 
nique  requires  an  iteration  involving  the  exact  solution  of  a  network  with  about 
400  customers  in  a  single  cl  c*.  ^  s «  Tris  computational  requirement  is  therefore  on 
the  order  of  400  for  both  time  and  space  (for  each  iteration). 

The  solution  of  the  network  required  five  iterations  to  converge  so  that 
predicted  multiprogramming  levels  for  the  various  classes  were  within  approxi- 
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mately  .5%  of  their  exact  values.  The  totai  computational  expenditure  was 
therefore  on  the  order  of  (&)(400)  =  2000,  in  contrast  to  1040  for  the  exact  solu¬ 
tion. 

The  approximate  solution  of  the  model  showed  that  a  number  of  the 
transmission  lines  were  operating  very  near  saturation,  while  others  were  only 
moderatly  loaded.  In  addition,  some  of  the  microprocessors  were  also  nearly 
overloaded.  Using  the  approximate  solutions  obtained  by  class  aggregation,  the 
model  could  be  used  to  investigate  the  effects  of  various  proposed  modifications 
to  the  system.  For  instance,  traffic  on  the  transmission  lines  could  be  reduced 
by  performing  data  compression  before  transmission.  This  could  be  reflected  in 
the  model  by  reducing  the  service  times  for  the  transmission  lines,  and  increas¬ 
ing  the  service  times  for  the  various  processors  (which  experience  increased 
overhead  due  to  data  compression  and  expansion).  Changes  to  the  distribution 
of  the  workloads  assigned  to  each  microprocessor  could  be  modeled  simply  by 
changing  the  relative  throughputs  of  the  processors  involved.  Finally,  the  use  of 
additional  transmission  lines  could  be  modeled  simply  by  increasing  the  number 
of  service  centers  in  the  model,  and  altering  the  relative  throughputs  to  reflect 
the  redistribution  of  the  workload  over  the  new  lines.  In  each  case,  useful  infor¬ 
mation  about  the  effectiveness  of  the  proposed  change  could  be  obtained  using 
the  approximate  solution  techniques  presented  in  this  thesis. 

We  believe  that  the  TSE  model  demonstrates  the  value  of  the  aggregation 
techniques  presented  earlier.  Without  such  approximate  solution  methods  the 
analyst  would  be  forced  to  model  the  system  at  only  very  high  levels  of  abstrac¬ 
tion  (where  the  number  of  the  components  of  the  model,  might  be  kept  small). 
The  approximation  techniques  therefore  represent  valuable  aides  in  the  practi¬ 
cal  problem  of  modeling  large  computer  systems. 
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9.3  FUTURE  WORK 

The  central  focus  of  this  thesis  has  been  the  use  of  aggregation  as  an 
approximation  technique  for  separable  queueing  network  models.  This  approach 
involves  detailed  information  about  the  state  probabilities  of  the  networks 
involved.  An  advantage  of  dealing  with  such  low  level  Quantities  is  that  the  tech¬ 
niques  are  very  flexible.  A  disadvantage  is  that  they  are  perhaps  subject  to  a 
compounding  of  small  errors  at  the  state  space  level  to  produce  significant 
errors  at  the  performance  measure  level.  This  is  in  contrast  to  the  MVA-based 
approaches,  which  deal  only  with  performance  measure  information.  Indeed, 
the  relationship  between  the  two  approaches  is  much  like  the  relationship 
between  the  convolution  and  MVA  solution  algorithms;  in  both  cases  the  former 
is  a  more  detailed  and  therefore  more  flexible  approach,  while  the  latter  is 
simpler  and  more  restrictive. 

Among  the  questions  that  remain  to  be  answered  is  the  extent  to  which 
these  approaches  will  prove  useful  in  practice.  This  is  a  question  that  can  be 
answered  only  by  actual  implementation  and  use  of  the  methods  *. 

Both  the  aggregation  and  MVA  techniques  discussed  address  only  one  of  the 
problems  that  face  the  analyst  attempting  to  model  actual  computer  systems, 
that  of  how  to  solve  very  large  models.  As  the  uses  for  which  these  models  are 
employed  become  more  sophisticated,  the  restrictions  required  for  a  model  to 
be  separable  may  become  unacceptable.  The  solution  to  these  problems 
requires  either  approximate  solution  techniques  for  non-separable  networks 
(such  as  those  in  [Mane  73]  and  [Tripathi  79])  or  the  use  of  structure  in  separ¬ 
able  networks  to  represent  approximately  the  non-separable  component  (such 
as  was  done  In  [Sevcik  77]).  Much  work  in  both  areas  remains  to  be  done. 


The  MVA-based  approximation  has  been  used  by  hard  v.ith  good  success  for  throughputs  as 
part  of  a  larger  approximation  technique  used  to  model  VM/370  [Bard  73]. 
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APPENDIX  A  -  Notation 


Ratio  of  the  service  rate  at 
center  m  when  there  are  nm 
customers  there  to  the  rate 
when  there  is  one  customer. 

ar,m{7W.m) 

Ratio  of  the  service  rate  at 
center  m  when  there  are  j v>m 
class  r  customers  there  to  the 
rate  when  there  is  one  class  r 
customer. 

nm 

j  - 1 

Auxiliary  function. 

*/ 

nT,m 

A  ( rn  }  ^  1  n»  /*  *?  \ 

~  |  j.  ~r.7n  w  / 

;-l 

Auxiliary  function. 

er,m 

Relative  throughput  of  class  r 
customers  at  center  m. 
(These  can  be  thought  of  as 
the  number  of  visits  of  a  class 
r  job  to  center  m.) 

F(a) 

The  set  of  classes  of  QN 
represented  by  class  a  of  Qft . 

F(r) 

The  aggregate  class  of  Qft  to 
which  class  r  of  QN  belongs. 

00) 

Normalizing  constant  of  the 
network  with  population  ft. 

M 

The  number  of  service 

centers  in  the  network. 

N 

The  number  of  customers  in  a 
single  class  model. 

ft 

The  multiprogramming  level 
vector. 

Nr 

The  multiprogramming  level 
of  class  r. 

Qr.m 

The  mean  number  of  class  r 
jobs  in  service  or  in  the  queue 
at  center  m. 

QN 

The  large  queueing  network  to 
be  solved. 
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The  aggregate  network  used 
to  approximate  QN. 

R  The  number  of  classes. 

Srm  Mean  service  time  of  class  r 

jobs  at  center  m  at  each  visit. 


7 


r,ra 


Throughput  of  class  r  at 
center  m. 


U. 


r,m 


Utilization  of  center  m  by 
class  t  customers. 


Xrm  =  eTTn*Srm  Relative  utilization  of  class  r 

at  center  m. 

A  tilde  over  a  symbol  denotes 
the  approximation  to  the 
value  represented  by  the 
symbol  (e.g.,  Trm  is  the  ap- 
pr  oxinx  a  t  i  o  n  t  o  Tr  rn ) . 

A  hat  over  a  symbol  denotes 
the  meaning  of  that  symbol 
applied  to  the  aggregate  net¬ 
work  (e.g.,  Tr<m  is  the 
throughput  of  class  r  custo¬ 
mers  at  center  m  in  the  ag¬ 
gregate  network). 


APPENDIX  B 


Given:  A  queueing  network  with  two  load  independent  service  centers,  and  two 
classes  with  one  customer  each.  Let  Xrm  be  the  relative  utilization  of 
class  r  at  center  m. 

We  wish  to  show  that  the  total  utilization  of  devices  one  and  two  are  underes¬ 
timated  by  the  class  aggregation  technique;  i.e., 

^  1  (B. 

where  S(n)  is  the  normalizing  constant  of  the  aggregate  network  when  there  are 
n  customers  in  it.  Let  be  the  relative  utilization  of  center  rn  in  the  aggre¬ 
gate  network.  By  definition, 

%m  -  (^A.m  +  UB,m) 

=  XA>mG(B)  +XBtmG(A) 

G(Al3) 

where  G(rt)  is  the  normalizing  constant  of  the  full  network  with  population  ft. 
Therefore, 

5(1)  -?i+£2 

5(2)  J?f  +  +  J?| 

We  can  prove  (B.l)  by  showing  that 

O  ,  O  1>?  .OO  .  'T>2 
A  1  t  A  2  5  A  |  -r  g  t  A  2 

Expanding  this  expression  in  terms  of  the  Xrm  ~we  get  a  necessary  and  sufficient 

condition  for  this  to  hold 

2X4.  \XB,  \Xa,  %Xb  2  ~  %a,  \XB.  2  ~  As,  r^A,  2  ^  0 
Since  the  Xrm  are  relative  utilizations,  we  can  assume  that  Xjt  t  =  1  without  loss 

of  generality.  Two  conditions  now  hold;  either  class  B  visits  center  one,  or  it 

does  not. 

Case  1:  XB%  1  =  0,  which  implies  that  XBt%  >  0. 


5(0 

5(2) 
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Case  2: 


~Xn.  2  S  0 

■-B,  i  >  0,  so  assume  Xjj  -  =  1. 

~  X-B,  2  ~  2  ^  0 

Let  XA,z  =  (l+s)A's, 2=  ”1  ^  e. 

2(1+e)AT|>3  -  X l  2  -  (l+e)^r|i2  £  0 

- E^Ajj  2  ~  t)  Q.E.D. 


APPENDIX  C 


TheoreiiuThe  approximation  for  the  throughput  of  a  single  class,  load  indepen¬ 
dent,  balanced  network  obtained  by  aggregating  customers  into  one  is 
an  overestimate  of  the  true  throughput  (for  integer  p). 

Proof: 

Since  the  system  is  balanced,  we  assume  that  the  relative  utilization  vector  is  a 
vector  of  M  ones.  The  ratio  of  the  the  approximate  throughput  to  the  actual 


throughpu 

—  - 

t,  E,  is  given 

M  vN  /p  -  2 
l  M- 1 

by 

l/p 

«T~ 1 

I  ^ 

^  1 

V, 

£j  — 

M  +N  /p  -  1 

i/p 

M+N-Z 

l  AT-1 

M  —  l 

l/p 

N 

Af+N-1 

N+(M-l)p 

N 

We  wish  to  show  that  E  ^  1. 


We  observe  that  E-  1  for  p  =  1.  For  larger  p  it  is  sufficient  to  show  that 

l  /j) 


N 

i/p  +  i 

7 

E 

N+(M-l)(p  +  l) 

N  +  (M-l)p 

or 


1 

1/P  + 1 

1 

l  +  c(p  +1) 

1  +  cp 

i  /v 


where  c  =  -----  --  (so  c  >  0).  Since  both  sides  of  the  inequality  are  positive,  we 
can  raise  them  to  the  power  p(p  +  l)  and  maintain  the  relationship 


1 

P 

1 

I4*c(p  +1) 

14-  cp 

(l+c  (p  4-1))35  £  (1  4-cp)p4‘1 

Sfe](c(p  +  l))»s'st  +  1]  (cp) 

71  — 0  71=0 
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T>  *  .  Tht,! 


t  cn(pr+iy 


(p  +  1)! 


^dQn\{p-n) ^  ‘  *dQn\{p  +  l—n)\ 


cnpn 


Taking  a  term  by  term  comparison,  this  is  true  if 
_ Pj _ xnn,  _ (p  + 1)! 


n\(p  — n )! 
for'  1  £n£p. 


(p  +  l)n  ^ 
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^  (p  +  l-n) 
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(p+(l~7l))(p  +  l)n  1  ^  pn 


(p  +(l— n ))  Y, 

i=o 


n  —  1 

.  i 


pJ  s,  p 


71-2 

S 

3=0 


n-1 
■  J 


p:'vl  +  (l-n)  V 

j-0 


n-1 

•  3 


p~J  £  0 


Since  the  only  constant  term  of  the  above  expression  is  negative,  the  relation¬ 
ship  is  true  if  the  coefficient  of  each  power  of  p  is  negative,  i.e.,  if 


(n-1)!  +  (l-n  )(n  —  1)!  £  Q 


j\(n-l-j)\  (j  +  l)\(n-j-2)\ 
for  j  =  0, 


(l-n)  < 

(n-l-j)  '  (J  + 1) 


+ 


0 


(j  +  l)+(l-n)(n-l-j)  ^  0 
n  (2+j )  -  nr  ^  0 

This  expression  is  maximized  with  respect  to  j  when  j  is  at  its  maximum,  n-2. 
The  maximum  is  therefore  easily  seen  to  be  0,  so  the  theorem  is  proven. 
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A  COMPARATIVE  ANALYSIS  OF  SEVERAL  DISK  SCHEDULING  ALGORITHMS 
C.J.M.  'Turnbull,  September  1972 

PROJECT  SUE  AS  A  LEARNING  EXPERIENCE 

K. C.  Sevcik  ct  al,  September  1972 
[Proceedings  AFIPS  Fall  Joint  Computer  Conference, 
v.  41,  December  1972] 

A  STUDY  OF  LANGUAGE  DIRECTED  COMPUTER  DESIGN 
David  B.  Wortman,  December  1972 
[Ph.D.  Thesis,  Computer  Science  Department, 

Stanford  University,  1972] 

AN  APL  TERMINAL  APPROACH  TO  COMPUTER  MAPPING 
R.  Kvaternilc,  December  1972 
[M.Sc.  Thesis,  DCS,  1972] 

AN  IMPLEMENTATION  LANGUAGE  FOR  MINICOMPUTERS 
G.G.  Kalmar,  January  1973 
[M.Sc.  Thesis,  DCS,  1972] 

COMPILER  STRUCTURE 

W.M.  McKeernan,  January  1973 

[Proceedings  of  the  US  A- Japan  Computer  Conference,  1972] 


CSRG-24 


CSRG-25 


CSRG-26 


CSRG-27 


CSRG-28 


CSRG-29 

CSRG-30 

CSRG-31 

CSRG-32 

CSRG-33 

CSRG-34 

CSRG-35 

CSRG-36 

CSRG-37 


AN  ANNOTATED  BIBLIOGRAPHY  ON  COMPUTER  PROGRAM 
ENGINEERING 

J  D.  Gannon  (ed.),  March  1973 

THE  INVESTIGATION  OF  SERVICE  TIME  DISTRIBUTIONS 
Eleanor  A.  Lester,  April  1973 
[M.Sc.  Thesis,  DCS,  1973] 

PSYCHOLOGICAL  COMPLEXITY  OF  COMPUTER  PROGRAMS: 

AM  INITIAL  EXPERIMENT 
Larry  Weissman,  August  1973 

STRUCTURED  SUBSETS  OF  THE  PL/I  LANGUAGE 

Richard  C.  Holt  and  David  B.  Wortman,  October  1973 

ON  REDUCED  MATRIX  REPRESENTATION  OF  LR(k) 

PARSER  TABLES 

Marc  Louis  Joliat,  October  1973 

[Ph.D.  Thesis,  EE  1973] 

A  STUDENT  PROJECT  FOR  AN  OPERATING  SYSTEMS  COURSE 
B.  Czarnik  and  D,  Tsichritzis  (eds.),  November  1973 

A  PSEUDO-MACHINE  FOR  CODE  GENERATION 
Henry  John  Pasko,  December  1973 
[M.Sc.  Thesis,  DCS  1973] 

AN  ANNOTAED  BIBLIOGRAPHY  ON  COMPUTER  PROGRAM  ENGINEERING 
J.D.  Gannon  (ed.).  Second  Edition,  March  1974 

SCHEDULING  MULTIPLE  RESOURCE  COMPUTER  SYSTEMS 

E. D.  Lazowslca,  May  1974 
[M.Sc.  Thesis,  DCS,  1974] 

AN  EDUCATIONAL  DATA  BASE  MANAGEMENT  SYSTEM 

F.  Lochovsky  and  D.  Tsichritzis,  May  1974 
[INFOR,  14  (3),  pp. 270-270,  1976] 

ALLOCATING  STORAGE  IN  HIERARCHICAL  DATA  BASES 
P.  Bernstein  and  D.  Tsichritzis,  May  1974 
[Information  Systems  Journal,  v.  1,  pp.  133-140] 

ON  IMPLEMENTATION  OF  RELATIONS 
D.  Tsichritzis,  May  1974 

SIX  PL/I  COMPILERS 

D.B.  Wortman,  P.J.  Khaiat,  and  D.M.  Lasker,  August  1974 
[Software  Practice  and  Experience,  v.B,  n.3, 

Juiy-Sept.  1976] 

A  METHODOLOGY  FOR  STUDYING  THE  PSYCHOLOGICAL  COMPLEXITY 
OF  COMPUTER  PROGRAMS 
Laurence  M.  Weissman,  August  1974 
[Ph.D.  Thesis,  DCS,  1974] 
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-  CSRG-38  AN  INVESTIGATION  OF  A  NEW  METHOD  OF  CONSTRUCTING  SOFTWARE 
David  M.  Lasker,  September  1974 
[M.Sc.  Thesis,  DCS.  1974] 

CSRG-39  AN  ALGEBRAIC  MODEL  FOR  STRING  PATTERNS 


Glenn  PL  Stewart,  September  1974 
[M.Sc.  Thesis,  DCS,  1974] 

* CSRG-40 

EDUCATIONAL  DATA  BASE  SYSTEM  USER’S  MANUAL 

J.  KlebanofF,  F.  Lochovsky,  A.  Rozitis,  and 

D.  Tsichritzis,  September  1974 

*  CSRG-41 

NOTES  FROM  A  WORKSHOP  ON  THE  ATTAINMENT  OF 

RELIABLE  SOFTWARE 

David  B.  Wortman  (ed.),  September  1974 

*  CSRG-42  THE  PROJECT  SUE  SYSTEM  LANGUAGE  REFERENCE  MANUAL 
B.L.  Clark  and  F.J.B.  Ham,  September  1974 


*  CSRG-43 

A  DATA  BASE  PROCESSOR 

E.A.  Ozkarahan,  S.A.  Schuster  and  K.C.  Smith, 

November  1974  [Proceedings  National  Computer 

Conference  1975,  v.44,  pp. 379-388] 

*  CSRG-44 

MATCHING  PROGRAM  AND  DATA  REPRESENTATION  TO  A 

COMPUTING  ENVIRONMENT 

Eric  C.R.  Hehner,  Novemvcr  1974 
[Ph.D.  Thesis,  DCS,  1974] 

See  Computer,  Vol.9,  No. 9,  August  1976,  pp. 65-70. 

*  CSRG-45 

THREE  APPROACHES  TO  RELIABLE  SOFTWARE;  LANGUAGE  DESIGN, 
DYADIC  SPECIFICATIONS,  COMPLEMENTARY  SEMANTICS 

J.E.  Donahue,  J.D.  Gannon,  J.V.  Guttag  and 

J.J.  Horning,  December  1974 

CSRG-46  THE  SYNTHESIS  OF  OPTIMAL  DECISION  TREES  FROM 
DECISION  TABLES 


Helmut  Schumacher,  December  1974 

[M.Sc.  Thesis,  DCS,  1974;  CACM,  v.19,  n.6,  June  1976] 

*  CSRG-47 

LANGUAGE  DESIGN  TO  ENHANCE  PROGRAMMING  RELIABILITY 

John  D.  Gannon,  January  1975 
[Ph.D.  Thesis,  DCS,  1975] 

*  CSRG-48 

DETERMINISTIC  LEFT  TO  RIGHT  PARSING 

Christopher  J.M.  Turnbull,  January  1975 
[Ph.D.  Thesis,  EE.  1974] 

*  CSRG-49  A  NETWORK  FRAMEWORK  FOR  RELATIONAL  IMPLEMENTATION 
D.  Tsichritzis,  February  1975  [in  Data  Base  Description, 
Dongue  and  Nijsson  (eds.),  North  Holland  Publishing  Co.] 
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*  CSRG-50  A  UNIFIED  APPROACH  TO  FUNCTIONAL  DEPENDENCIES 

AND  RELATIONS 

P.A.  Bernstein,  J.R.  Swenson  and  D.C.  Tsiehritzis 
February  1975  [Proceedings  of  the  ACM  SIGMOD 
Conference,  1975J 

*  CSRG-51  ZETA:  A  PROTOTYPE  RELATIONAL  DATA  BASE  MANAGEMENT  SYSTEM 

M.  Drodie  (ed),  February  1975  [Proceedings  Pacific  ACM 
Conference,  1975] 

*  CSRG-52  AUTOMATIC  GENERATION  OF  SYNTAX-REPAIRING  AND 

PARAGRAPHING  1  PARSERS 
David  T.  Barnard,  March  1975 
[M.Sc.  Thesis,  DCS,  1975] 

*  CSRG-53  QUERY  EXECUTION  AND  INDEX  SELECTION  FOR  RELATIONAL 

DATA  BASES 

J.H.  Gilies  Farley  and  Stewart  A.  Schuster,  March  1975 

CSRG-54  AN  ANNOTATED  BIBLIOGRAPHY  ON  COMPUTER  PROGRAM 
ENGINEERING 

J.V.  Gut.tag  (ed.),  Third  Edition,  April  1975 

CSRG-55  STRUCTURED  SUBSETS  OF  THE  PL/1  LANGUAGE 

Richard  C.  Holt  and  David  B.  Wortman,  May  1975 

*  CSRG-56  FEATURES  OF  A  CONCEPTUAL  SCHEMA 

D.  Tsiehritzis,  June  1975  [Proceedings  Very  Large 
Data  Base  Conference,  1975] 

*  CSRG-57  MERLIN:  TOWARDS  AN  IDEAL  PROGRAMMING  LANGUAGE 

Eric  C.R.  Hchner,  July  1975 

sec  Acta  Informatica  Col.  10,  No. 3,  pp. 229-243,  1978 

CSRG-58  ON  THE  SEMANTICS  OF  THE  RELATIONAL  DATA  MODEL 
Hans  Albrecht  Schmid  and  J.  Richard  Swenson, 

July  1975  [Proceedings  of  the  ACM  SIGMOD  Conference,  1975] 

*  CSRG-59  THE  SPECIFICATION  AND  APPLICATION  TO  PROGRAMMING 

OF  ABSTRACT  DATA  TYPES 
John  V.  Guttag,  September  1975 
[Pii.D.  Thesis,  DCS,  1975] 

*  CSRG-60  NORMALIZATION  AND  FUNCTIONAL  DEPENDENCIES  IN  THE 

RELATIONAL  DATA  BASE  MODEL 
Phillip  Alan  Bernstein,  October  1975 
[Ph.D.  Thesis.  DCS,  1975] 

*  CSRG-61  LSI,:  A  LINK  AND  SELECTION  LANGUAGE 

D.  Tsiehritzis,  November  1975  [Proceedings  ACM 
SIGMOD  Conference,  1976] 
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*  CSRG-62 


CSRG-63 


CSRG-64 


CSRG-65 


CSRG-66 


CSRG-67 


*  CSRG-68 


CSRG-69 


*  CSRG-70 


*  CS.RG-71 


*  CSRG-72 


COMPLEMENTARY  DEFINITIONS  OF  PROGRAMMING  LANGUAGE 
SEMANTICS 

James  E.  Donahue,  November  1975 
[Ph.D.  Thesis,  DCS,  1975] 

AN  EXPERIMENTAL  EVALUATION  OF  CHESS  PLAYING  HEURISTICS 
Lazio  Sugar,  December  1975 
[M.Sc.  Thesis,  DCS,  1975] 

A  VIRTUAL  MEMORY  SYSTEM  FOR  A  RELATIONAL  ASSOCIATIVE 
PROCESSOR 

S.A.  Schuster,  E.A.  Ozkarahan,  and  K.C.  Smith, 

February  1976  [Proceedings  National  Computer 
Conference  1976,  v.45,  pp. 855-862] 

PERFORMANCE  EVALUATION  OF  A  RELATIONAL,  ASSOCIATIVE 
PROCESSOR 

E.A.  Ozkarahan,  S.A.  Schuster,  and  K.C.  Sevcik, 

February  1976  [ACM  Transactions  on  Database 
Systems,  v.  1,  n:4,  December  1976] 

EDITING  COMPUTER  ANIMATED  FILM 
Michael  D.  Tilson,  February  1976 
[M.Sc.  Thesis,  DCS,  1975] 

A  DIAGRAMMATIC  APPROACH  TO  PROGRAMMING  LANGUAGE 
SEMANTICS 

James  R.  Cordy,  March  1976 
[M.Sc.  Thesis,  DCS,  1976] 

A  SYNTHETIC  ENGLISH  QUERY  LANGUAGE  FOR  A  RELATIONAL 
ASSOCIATIVE  PROCESSOR 

L.  Kerschberg,  E.A.  Ozkarahan,  and  J.E.S.  F’acheco, 

April  1976 

AN  ANNOTATED  BIBLIOGRAPHY  ON  COMPUTER  PROGRAM 
ENGINEERING 

D.  Barnard  and  D.  Thompson  (eds.),  Fourth  Edition, 

May  1976 

A  TAXONOMY  OF  DATA  MODELS 

L.  Kerschberg,  A.  Klug,  and  D.Tsichritzis,  May  1976 
[Proceedings  Very  Large  Data  Base  Conference,  1976] 

OPTIMIZATION  FEATURES  FOR  THE  ARCHITECTURE  OF  A 
DATA  BASE  MACHINE 

E. A.  Ozkarahan  and  K.C.  Sevcik,  May  1976 

[ACM  Transactions  of  Database  Systems,  v.2,  n.4,  December  1977] 

THE  RELATIONAL  DATA  BASE  SYSTEM  OMEGA  -  PROGRESS  REPORT 
H.A.  Schmid  (ed.),  P.A.  Bernstein  (ed.),  B.  Arlow, 

R.  Baker  and  S.  Pozgaj,  July  1976 
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CSRG-73 


*  CSRG-74 


CSRG-75 


CSRG-76 


CSRG-77 


CSRG-73 


CSRG-79 


CSRG-BO 


CSRG-81 


CSRG-B2 


CSRG-B3 


CSRG-64 


CSRG-B5 


AN  ALGORITHMIC  APPROACH  TO  NORMALIZATION  OP 
RELATIONAL  DATA  RASP  SCHEMAS 
P.A.  Bernstein  and  C.  Beeri,  September  1976 

A  HIGH-LEVEL  MACHINE-ORIENTED  ASSEMBLER  LANGUAGE 
FOR  A  DATA  BASE  MACHINE 

E.A.  Ozkarahan  and  S.A.  Schuster,  October  1976 

DO  CONSIDERED  OD:  A  CONTRIBUTION  TO  THE  PROGRAMMING 
CALCULUS 

Eric  C.R.  Hehner,  November  1976 
Acta  Inforniatica  to  appear  1979 

SOFTWARE  HUT:  A  COMPUTER  PROGRAM  ENGINEERING 
PROJECT  IN  THE  FORM  OF  A  GAME 
J.J.  Horning  and  D.B.  Wortman,  November  1976 

[IEEE  Transactions  on  Software  Engineering,  v.SE-3,  n.4,  July  1977] 

A  SHORT  STUDY  OF  PROGRAM  AND  MEMORY  POLICY  BEHAVIOUR 
G.  Scott  Graham,  January  1977 

A  PANACHE  OF  DBMS  IDEAS 

D.  Tsichritzis  (cd.),  February  1977 

THE  DESIGN  AND  IMPLEMENTATION  OF  AN  ADVANCED  LALR 
PARSE  TABLE  CONSTRUCTOR 
David  IT.  Thompson,  April  1977 
[M.Sc.  Thesis,  DCS,  1976] 

AN  ANNOTATED  BIBLIOGRAPHY  ON  COMPUTER  PROGRAM 
ENGINEERING 

D.  Barnard  (ed.),  Fifth  Edition,  May  1977 

PROGRAMMING  METHODOLOGY:  AN  ANNOTATED  BIBLIOGRAPHY 
FOR  1FIP  WORKING  GROUP  2.3 

Sol  J.  Greenspan  and  J.J.  Horning  (eds.),  First  Edition,  May  1977 
NOTES  ON  EUCLID 

edited  by  W.  David  Elliot  and  David  T.  Barnard,  August  1977 

TOPICS  IN  QUEUEING  NETWORK  MODELING 
ediled  by  G.  Scott  Graham,  July  1977 

TOWARD  PROGRAM  ILLUSTRATION 
Edward  Yarwood,  September  1977 
[M.Sc.  Thesis,  DCS,  1974] 

CHARACTERIZING  SERVICE  TIME  AND  RESPONSE  TIME 

DISTRIBUTIONS  IN  QUEUEING  NETWORK  MODELS  OF  COMPUTER 
SYSTEMS 

Edward  D.  Lazowska,  September  1977 
[PAD.  Thesis,  DCS,  1977] 
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CSRG-86  MEASUREMENTS  OF  COMPUTER  SYSTEMS  FOR  QUEUEING 
NETWORK  MODELS 
Martin  G.  Kienzle,  October  1977 

[M.Sc.  Thesis,  DCS,  1977;  Proc.  Int.  Symp.  on  Modelling  and  Performance 
Evaluation  of  Computer  Systems,  Vienna,  1979] 

CSRG-87  ’OLGA’  LANGUAGE  REFERENCE  MANUAL 

B.  Abourbih,  H.  Trickey,  D.M.  Lewis.  E.S.  Lee, 

P.I.P.  Boulton,  November  1977 

CSRG-B8  USING  A  GRAMMATICAL  FORMALISM  AS  A  PROGRAMMING  LANGUAGE 
Brad  A.  Silverberg,  January  1978 
[M.Sc.  Thesis,  DCS,  1978] 

CSRG-89  ON  THE  IMPLEMENTATION  OF  RELATIONS:  A  KEY  TO  EFFICIENCY 
Joachim  W.  Schmidt,  January  1978 

C3RG-90  DATA  BASE  MANAGEMENT  SYSTEM  USER  PERFORMANCE 
Frederick  H.  Lochovsky,  April  1978 
[Ph.D.  Thesis,  DCS,  1979] 

CSRG-91  SPECIFICATION  AND  VERIFICATION  OF  DATA  BASE 
SEMANTIC  INTEGRITY 
Michael  Lawrence  Brodie,  April  1978 
[Ph.D.  Thesis,  DCS,  1978] 

CSRG-92  STRUCTURED  SOUND  SYNTHESIS  PROJECT  (SSSP): 

AN  INTRODUCTION 

by  William  Buxton,  Guy  Fedorkow,  with  Ronald  Baecker, 

Gustav  Ciamaga,  Leslie  Mezei  and  K.C.  Smith,  June  1978 

CSRG-93  A  DEVICE-INDEPENDENT, GENERAL-PURPOSE  GRAPHICS  SYSTEM 
IN  A  MINICOMPUTER  TIME-SHARING  ENVIRONMENT 
William  T.  Reeves,  August  1978 
[M.Sc.  Thesis,  DCS,  1976] 

CSRG-94  ON  THE  AXIOMATIC  VERIFICATION  OF 
CONCURRENT  ALGORITHMS 
Christian  Lengauer,  August  1978 
[M.Sc.  Thesis.  DCS,  1978] 

CSRG-95  PISA:  A  PROGRAMMING  SYSTEM  FOR  INTERACTIVE 
PRODUCTION  OF  APPLICATION  SOFTWARE 
Rudolf  Marty,  August  1978 

CSRG-96  ADAPTIVE  MICROPROGRAMMING  AND  PROCESSOR  MODELING 
Walter  G.  Rosocha 
[Ph.D.  Thesis.  EE,  August  1979] 

CSRG-97  DESIGN  ISSUES  IN  THE  FOUNDATION  OF  A  COMPUTER-BASED 
TOOL  FOR  MUSIC  COMPOSITION 
William  Buxton 

[M.Sc.  Thesis,  CSRG.  October  1978] 
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CSRG-98  THEORY  OF  DATABASE  MAPPINGS 
Anthony  C.  Klug 

[Pb.D.  Thesis,  DCS,  December  1978] 

CSRG-99  HIERARCHICAL  COROUTINES:  A  MECHANISM  FOR  IMPROVED 
PROGRAM  STRUCTURE 
Leonard  I.  Vanek,  February  1979 

CSRG-100  TOPICS  IN  PERFORMANCE  EVALUATION 
G.  Scott  Graham  (ed.),  July  1979 

CSRG-101  A  PANACHE  OF  DBMS  IDEAS  II 

F.H.  Lochovsky  (ed.).  May  1979 

CSRG-102  A  SIMPLE  SET  THEORY  FOR  COMPUTING  SCIENCE 
Eric  C.R.  Hehner.  May  1979 

CSRG-1 03  THE  CENTRALIZED  ALGORITHM  IN  DISTRIBUTED  SYSTEMS 
Ernest  J.H.  Chang 
[Ph.D.  Thesis,  DCS,  July  1979] 

CSRG-1 04  ELIMINATING  THE  VARIABLE  FROM  DIJKSTRA’S 
MINI-LANGUAGE 
D.  Hugh  Redelmeicr,  July  1979 

CSRG-1 05  A  LANGUAGE  FACILITY'  FOR  DESIGNING  INTERACTIVE 
DATABASE-INTENSIVE  APPLICATIONS 
John  Mylopoulos,  Philip  A.  Bernstein,  Harry  K.T.  Wong, 
July  1979 

CSRG-1 06  ON  APPROXIMATE  SOLUTION  TECHNIQUES  FOR 

QUEUEING  NETWORK  MODELS  OF  COMPUTER  SYSTEMS 
Satish  Kumar  Tripathi,  July  1979 

CSRG-107  A  FRAMEWORK  FOR  VISUAL  MOTION  UNDERSTANDING 
John  K.  Tsotsos,  John  Mylopoulos,  H.  Dominic  Covvey 
Steven  W.  Zucker,  DCS,  June  1979 

CSRG-1 08  DIALOGUE  ORGANIZATION  AND  STRUCTURE  FOR 
INTERACTIVE  INFORMATION  SYSTEMS 
John  Leonard  Barron 
[M.Sc.  Thesis,  DCS,  1900] 

CSRG-1 09  A  UNIFYING  MODEL  OF  PHYSICAL  DATABASES 
D.S.  Batory,  C.C.  Gotlieb,  April  1980 

CSRG-1 10  OPTIMAL  FILE  DESIGNS  AND  REORGANIZATION  POINTS 
D.S.  Batory,  April  1980 

CSRG-1 11  A  PANACHE  OF  DBMS  IDEAS  III 
D.  Tsichrilzia  (ed.),  April  I960 
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CSRG-1  12  TOPICS  IN  PSN  -  II:  EXCEPTIONAL  CONDITION 

HANDLING  IN  PSN;  REPRESENTING  PROGRAMS  IN  PSN: 
CONTENTS  IN  PSN 

Yves  Lcspcrance,  Byran  M.  Krarncr,  Peter  F;  Schneider 
April,  1900 


CSKG-1 13  SYSTEM-ORIENTED  MACRO-SCHEDULING 
C.C.  Gollicb  and  A.  Schonbach 
ay  1900 


\  /  . 
ivicA 


CSRG-1 14  A  FRAMEW CRK  FOR  VISUAL  MOTION  UNDERSTANDING 
John  Konstantine  Tsotsos 
[Ph.D.  Thesis,  DCS,  June  1980] 


CSRG-1  15  SPECIFICATION  OF  CONCURRENT  EUCLID 
James  R.  Cordy  and  Richard  C.  Holt 
July  1980 


CSRG-1 16  THE  REPRESENTATION  OF  PROGRAMS  IN  THE 

PROCEDURAL  SEMANTIC  NETWORK  FORMALISM 

Bryan  M.  Kramer 

[M.Se.  Thesis,  DCS,  1980] 


CSRG-1 17  CONTEXT-FREE  GRAMMARS  AND  DERIVATION  TREES  AS 
PRO  G  RAM  MI  N  G  TO  0  LS 
Volker  Linnemann 
September.  I960 


CSRG-1 18  S/'SL:  SYNTAX/SEMANTIC  LANGUAGE 
INTRODUCTION  AND  SPECIFICATION 
R.C.  Holt,  JiR.  Cordy,  D.D.  Wortman 
CSRG,  September  1980 

CSRG-1 19  PT:  A  PASCAL  SUBSET 
Alan  Rosselct 

[M.Se.  Thesis,  DCS,  October  1900] 


CSRG- 120  PTED:  A  STANDARD  PASCAL  TEXT  EDITOR  BASED  ON 
THE  KERN1GHANL  AJVD  PLAUGER  DESIGN 
Ken  Newman,  DCS 
October  1980 

CSRG  - 1 2 1  TERMINAL  CONTEXT  GRAMMARS 
Howard  W.  Triolccy 
[M.Se.  Thesis,  EE,  September  1980] 

CSRG- 122  THE  APPROXIMATE  SOLUTION  OF  LARGE  QUEUEING 
NETWORK  MODELS 
John  Zahorjan 

[Ph.D.  Thesis,  DCS,  August  1900] 
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